ORIGINAL RESEARCH article

Front. Genet., 28 September 2022

Sec. RNA

Volume 13 - 2022 | https://doi.org/10.3389/fgene.2022.975521

Integration of three machine learning algorithms identifies characteristic RNA binding proteins linked with diagnosis, immunity and pyroptosis of IgA nephropathy

  • 1. Department of Nephrology, People’s Hospital of Xinjiang Uygur Autonomous Region, Urumqi, China

  • 2. Department of Cardiology, People’s Hospital of Xinjiang Uygur Autonomous Region, Urumqi, China

  • 3. Department of Nephrology, The First Affiliated Hospital of Xinjiang Medical University, Urumqi, China

Article metrics

View details

5

Citations

2,6k

Views

1,3k

Downloads

Abstract

Objective: RNA-binding proteins (RBPs) are essential for most post-transcriptional regulatory events, which exert critical roles in nearly all aspects of cell biology. Here, characteristic RBPs of IgA nephropathy were determined with multiple machine learning algorithms.

Methods: Our study included three gene expression datasets of IgA nephropathy (GSE37460, GSE73953, GSE93798). Differential expression of RBPs between IgA nephropathy and normal samples was analyzed via limma, and hub RBPs were determined through MCODE. Afterwards, three machine learning algorithms (LASSO, SVM-RFE, random forest) were integrated to determine characteristic RBPs, which were verified in the Nephroseq database. Immune cell infiltrations were estimated through CIBERSORT. Utilizing ConsensusClusterPlus, IgA nephropathy were classified based on hub RBPs. The potential upstream miRNAs were predicted.

Results: Among 388 RBPs with differential expression, 43 hub RBPs were determined. After integration of three machine learning algorithms, three characteristic RBPs were finally identified (DDX27, RCL1, and TFB2M). All of them were down-regulated in IgA nephropathy than normal specimens, with the excellent diagnostic efficacy. Additionally, they were significantly linked to immune cell infiltrations, immune checkpoints, and pyroptosis-relevant genes. Based on hub RBPs, IgA nephropathy was stably classified as two subtypes (cluster 1 and 2). Cluster 1 exhibited the relatively high expression of pyroptosis-relevant genes and characteristic RBPs. MiR-501-3p, miR-760, miR-502-3p, miR-1224-5p, and miR-107 were potential upstream miRNAs of hub RBPs.

Conclusion: Collectively, our findings determine three characteristic RBPs in IgA nephropathy and two RBPs-based subtypes, and thus provide a certain basis for further research on the diagnosis and pathogenesis of IgA nephropathy.

Introduction

Immunoglobulin A (IgA) nephropathy is the most frequent form of primary glomerulonephritis globally (Moldoveanu et al., 2021). About one-third of IgA nephropathy patients will develop end-stage renal disease within 20 years after diagnosis by kidney biopsy (Zeng et al., 2021). IgA nephropathy has been an important cause of end-stage renal disease among young adults. The predominant histological characteristics are immune deposits dominated by granular diffuse IgA (primarily comprising polymeric IgA1) in the mesangial region, usually linked to increased mesangial cells along with matrix expansion (Xie et al., 2021). In China, IgA nephropathy occupies 45.26% of primary glomerular diseases, and remains the most common cause of uremia (26.69%) (Li Y. et al., 2021). Currently, the comprehension of the pathophysiology of IgA nephropathy remains undefined, which involves multiple potential players (composed of mucosal immune system, complement system, microbiome, etc.) (Suzuki and Novak, 2021). Nevertheless, the absence of gene models to diagnose IgA nephropathy limits personalized risk-based therapeutic options.

RNA binding proteins (RBPs) exert critical roles in nearly all aspects of cell biology (Sternburg and Karginov, 2020). They orchestrate post-transcriptional regulatory events of gene expression (messenger RNA (mRNA) splicing, RNA stability, translation, etc.) (Wu and Xu, 2022). RBPs act as repressors or activators when interacting with mRNAs, and their binding sites are broad ranging from 5′-UTR to 3′-UTR. The most updated human RBP catalog comprises of 1,542 genes (Supplementary Table S1). Several RBPs have been proven to correlate with innate immune response and various programmed cell death types especially pyroptosis (Zheng and Kanneganti, 2020). Altered expression and dysfunctions of RBPs result in IgA nephropathy progression (Hahn et al., 2010; Wu et al., 2020). In the present study, three machine learning algorithms (LASSO, SVM-RFE, random forest) were integrated to determine characteristic RBPs in IgA nephropathy as well as developed two RBPs-based subtypes, offering a certain basis for further research on the diagnosis and pathogenesis of IgA nephropathy. Figure 1 illustrates the overall design of our study.

FIGURE 1

Materials and methods

Data collection

This study retrospectively included four gene expression datasets of IgA nephropathy from the Gene Expression Omnibus (GEO) repository (https://www.ncbi.nlm.nih.gov/gds/), including GSE37460 (Guo et al., 2019), GSE73953 (Nagasawa et al., 2016), and GSE93798 (Liu et al., 2017). The raw transcriptomic data from the Affymetrix platform were pre-processed with robust multiarray averaging method derived from Affy package (Gautier et al., 2004). The batch effects from different datasets were eliminated by ComBat method from surrogate variable analysis (sva) package (Leek et al., 2012). Figures 2A,B depicted the principal component analysis (PCA) before and after batch correction. Probe IDs were mapped to gene symbols on the basis of the corresponding annotation files, and the expression values of all probes matching the same gene were averaged as the final value. Additionally, we retrieved microRNA (miRNA) expression profiling of IgA nephropathy from GSE25590 dataset (Serino et al., 2012).

FIGURE 2

Identification of RBPs with differential expression

Differential expression of RBPs between IgA nephropathy and normal samples was analyzed through linear models for microarray data (limma) (Ritchie et al., 2015). False discovery rate (FDR)<0.05 was set as the cut-off criterion. Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis of RBPs with differential expression was executed utilizing clusterProfiler package (Yu et al., 2012).

Analysis of protein-protein interaction (PPI)

Interaction between RBPs with differential expression was probed utilizing the Search Tool for the Retrieval of Interacting Genes (STRING) online database according to the default criteria. Through Cytoscape plugin Molecular Complex Detection (MCODE), the important modules in the PPI network were visualized, and the hub RBPs were identified for subsequent analysis (Shannon et al., 2003).

Screening characteristic RBPs

Three machine learning algorithms were applied for selecting characteristic RBPs. Hub RBPs were utilized for establishing a penalized multivariate Cox proportional hazard survival model through variable selection on the basis of L1-penalized Least Absolute Shrinkage and Selection Operator (LASSO) regression approach from glmnet package, with 10-fold cross-validation (Goeman, 2010). Support Vector Machine-Recursive Feature Elimination (SVM-RFE) method was employed to search for lambda with the smallest classification error to determine the variables. Random forest algorithm was implemented for generating decision tree forest with 10-fold cross-validation. Characteristic RBPs analyzed by above algorithms were intersected.

Gene set enrichment analysis (GSEA)

GSEA was executed for determining the significant functional terms between groups (Subramanian et al., 2005). The “c2. cp.kegg.v7.5. symbols.gmt” was downloaded from the Molecular Signatures Database (MSigDB), as the reference gene set (Liberzon et al., 2015). The gene set was regarded as significant enrichment if FDR<0.05.

Estimation of immune cell infiltrations and immune checkpoints

CIBERSORT, a deconvolution algorithm, was execute to quantify 24 immune cell types via applying 547 gene expression signatures (Newman et al., 2015). The permutations were set as 100. Samples with p < 0.05 reflected that the deconvolution results were relatively reliable, which were included for subsequent analysis. We also collected common immune checkpoints from published literature (IDO1, LAG3, CTLA4, TNFRSF9, ICOS, CD80, PDCD1LG2, CD70, TNFSF9, KIR3DL1, CD86, PDCD1, LAIR1, TNFRSF8, TNFSF15, TNFRSF14, CD40, TNFRSF4, TNFSF14, HHLA2, CD244, CD27, LGALS9, CD28, CD48, TNFRSF25, CD40LG, VTCN1, CD160, CD44, TNFSF18, BTNL2, TNFSF4, CD200, and NRP1.

Consensus clustering analysis

Through ConsensusClusterPlus package (Wilkerson and Hayes, 2010), a consistency matrix of IgA nephropathy samples was established based on the expression profiling of hub RBPs according to item subsampling = 0.8, feature subsampling = 0.8, distance = 1—Pearson correlation, iteration = 1,000, and the maximum k value = 9. The optimal number of clusters was determined through consensus CDF, tracking plot, and consensus matrix. Principal component analysis (PCA) was conducted to verify this classification.

Gene set variation analysis (GSVA)

A single sample gene set enrichment analysis (ssGSEA) was executed for estimating the enrichment score of the specified gene signature utilizing GSVA package, with the “c2. cp.kegg.v7.5. symbols.gmt” as the reference gene set (Hänzelmann et al., 2013). The enrichment score was compared between subtypes via limma approach.

External validation

The expression of characteristic RBPs was externally verified in the Nephroseq database (http://v5.nephroseq.org/) that combines a large number of publicly available renal transcriptome profiling. Associations between characteristic RBPs and clinical features were also evaluated.

Establishment of a miRNA-hub RBP network

The Encyclopedia of RNA interactomes (ENCORI) integrates eight distinct databases for predicting miRNA-mRNA interactions. Here, four databases (Targetscan, PITA, PicTar, and miRanda) were utilized. MiRNAs were regarded as upstream miRNAs of hub RBPs if the results appeared in ≥2 databases. Afterwards, a miRNA-hub RBP network was visualized via cytoscape software.

Statistical analysis

All statistical analysis was executed with R software (version 3.6.3). Continuous variables that fit normal distribution between binary groups were compared utilizing student’s t-test. Otherwise, Mann-Whitney U test was carried out. Receiver operating characteristic curves (ROCs) were plotted to evaluate the diagnostic efficacy of characteristic RBPs in IgA nephropathy. Associations between variables were evaluated with Pearson or Spearman coefficients. The significance was set as p < 0.05, and all statistical tests were two-sided.

Results

Identification of hub RBPs in IgA nephropathy

A total of 388 RBPs with differential expression were identified according to FDR<0.05 (Figures 2C,D; Supplementary Table S2). GO enrichment results (nucleic acid binding, RNA binding, mRNA binding, AU-rich element binding, mRNA 3′-UTR binding, etc.) revealed the primary biological functions of these RBPs (Figures 2E–G). Additionally, spliceosome, RNA transport, mRNA surveillance pathway, ribosome biogenesis in eukaryotes, RNA degradation and aminoacyl-tRNA biosynthesis pathways were significantly enriched by these RBPs with differential expression (Figure 2H). A total of 43 hub RBPs were identified, including DDX56, UTP3, MPHOSPH10, DDX10, CEBPZ, TRMT11, NOP14, RRP9, PA2G4, NGDN, KRR1, UTP14A, MAK16, UTP18, DDX27, ABT1, DDX24, DNTTIP2, NOP2, IMP3, EXOSC10, DHX32, RRP1, RRP1B, TFB2M, GNL2, RRP7A, NOC4L, LSG1, DDX17, EIF4A3, GNL3, DKC1, RSL1D1, NOL6, DDX31, DDX5, TRMT1, NOLC1, RBM34, RCL1, PNO1, DDX18 (Figure 2I).

Identification of characteristic RBPs in IgA nephropathy via integrating three machine learning algorithms

Three machine learning algorithms were applied for identifying characteristic RBPs in IgA nephropathy. According to LASSO model, 15 characteristic genes were identified, composed of DDX56, DDX10, UTP14A, DDX27, DDX24, DNTTIP2, IMP3, EXOSC10, TFB2M, RRP7A, GNL3, RSL1D1, DDX31, TRMT1, and RCL1 (Figures 3A,B). SVM-RFE analysis identified 10 characteristic genes including TFB2M, RCL1, RSL1D1, RRP7A, DDX56, DDX27, NGDN, DDX24, TRMT1, and CEBPZ (Figure 3C). Additionally, three characteristic genes were determined based on random forest approach, comprising TFB2M, DDX27, and RCL1 (Figures 3D,E). After integration of above machine learning algorithms, we finally determined TFB2M, DDX27, and RCL1 as characteristic RBPs of IgA nephropathy (Figure 3F). Compared with normal specimens, TFB2M, DDX27, and RCL1 expressions were significantly down-regulated in IgA nephropathy (Figure 3G).

FIGURE 3

Characteristic RBPs as reliable diagnostic markers of IgA nephropathy

To evaluate the diagnostic efficacy of the characteristic RBPs in IgA nephropathy, ROCs were plotted. As a result, the AUC values of DDX27 (Figure 4A), RCL1 (Figure 4B), and TFB2M (Figure 4C) were separately 0.82, 0.78, and 0.85. Above analysis demonstrated that the characteristic RBPs might be reliable diagnostic markers of IgA nephropathy.

FIGURE 4

Signaling pathways involved in characteristic RBPs

DDX27 was positively correlated to histidine metabolism, glyoxylate and dicarboxylate metabolism, β-alanine metabolism, glycine serine and threonine metabolism, pantothenate and coA biosynthesis, and drug metabolism other enzymes (Figure 4D). In Figure 4E, RCL1 was negatively linked to steroid hormone biosynthesis, pentose and glucuronate interconversions, histidine metabolism, glycerolipid metabolism, nitrogen metabolism and lysine degradation. Moreover, TFB2M exhibited positive associations with pentose and glucuronate interconversions, steroid hormone biosynthesis, drug metabolism other enzymes, cysteine and methionine metabolism, starch and sucrose metabolism and drug metabolism cytochrome P450 (Figure 4F).

Characteristic RBPs are linked to immune cell infiltrations and pyroptosis in IgA nephropathy

Through CIBERSORT approach, we quantified the enrichment levels of 24 immune cell types. As illustrated in Figure 5A, there were dramatic interactions between immune cells. Compared with normal specimens, CD8 T cells, follicular helper T cells, gamma delta T cells, resting NK cells, activated NK cells, M1 macrophages, M2 macrophages, resting dendritic cells, activated dendritic cells, endothelial cells and fibroblasts exhibited higher enrichment levels in IgA nephropathy (Figure 5B). We also evaluated the correlations between characteristic RBPs and immune cell infiltrations. In Figure 5C, DDX27 was positively associated with memory resting CD4 T cells, but was negatively associated with endothelial cells, M2 macrophages, gamma delta T cells, fibroblasts, M1 macrophages, resting dendritic cells, CD8 T cells, and monocytes. RCL1 exhibited positive interactions with regulatory T cells (Tregs), follicular helper T cells, memory activated CD4 T cells, resting mast cells, memory B cells, but exhibited negative interactions with endothelial cells, fibroblasts, M1 macrophages, and M2 macrophages (Figure 5D). Additionally, TFB2M was negatively linked to endothelial cells, fibroblasts, M1 macrophages, M2 macrophages, activated dendritic cells, activated NK cells, gamma delta T cells, resting NK cells, resting dendritic cells, CD8 T cells, naïve B cells, and M0 macrophages (Figure 5E). Pyroptosis is a type of cell death, that is, crucial for immunity (Niu et al., 2022). Here, the relationships of characteristic RBPs with pyroptosis-relevant genes (DDX27, RCL1, and TFB2M) were analyzed. Among them, TFB2M displayed significantly negative correlations to NOD2, NLRP1, TP53, CASP4, and TNF (Figures 5F–H).

FIGURE 5

IgA nephropathy is classified as two subtypes based on hub RBPs

Through consensus clustering approach, IgA nephropathy specimens were stably classified as two subtypes, namely cluster 1 and 2 (Figures 6A–D; Supplementary Table S3). PCA plot also demonstrated the difference between subtypes (Figure 6E). Additionally, most hub RBPs exhibited higher expressions in cluster 1 than 2 (Figure 6F).

FIGURE 6

Differences in signaling pathways, immune cell infiltrations, immune checkpoints and pyroptosis between hub RBPs-based subtypes

In Figure 7A, cytosolic DNA sensing pathway, proteasome, RNA degradation, nucleotide excision repair, mismatch repair, DNA replication, spliceosome, basal transcription factors and RNA polymerase exhibited higher enrichment levels in cluster 1 than 2. Oppositely, linoleic acid metabolism, olfactory transduction, aldosterone regulation sodium reabsorption, folate biosynthesis, renin angiotensin system, PPAR signaling pathway, pyruvate metabolism, citrate cycle TCA cycle, valine leucine and isoleucine degradation, proximal tubule bicarbonate reclamation, drug metabolism cytochrome P450, tyrosine metabolism, peroxisome, phenylalanine metabolism, glycine serine and threonine metabolism, arginine and proline metabolism, glycolysis gluconeogenesis, β-alanine metabolism, limonene and pinene degradation, steroid hormone biosynthesis, retinol metabolism and nitrogen metabolism had higher enrichment levels in cluster 2 in comparison to cluster 1. Compared with cluster 1, cluster 2 was characterized by lower infiltrations of memory B cells, and higher infiltrations of plasma cells, M2 macrophages, activated mast cells and neutrophils (Figure 7B). Additionally, immune checkpoints IDO1, CD86, LAIR1, TNFRSF14, TNFRSF4, CD48, TNFRSF25, CD44, and NRP1 were up-regulated in IgA nephropathy than normal specimens (Figure 7C). Differently, down-regulated LAG3, CTLA4, TNFSF9, PDCD1, TNFRSF8, HHLA2, CD40LG, BTNL2, TNFSF4, and CD200 were found in IgA nephropathy. In Figure 7D, cluster 1 was characterized by higher expressions of IDO1, TNFRSF8, TNFRSF14, CD40, CD48, CD44 and NRP1 than cluster 2. Most pyroptosis-relevant genes exhibited higher expression in IgA nephropathy versus normal specimens (Figure 7E), indicating the activation of pyroptosis pathway in IgA nephropathy. Afterwards, the heterogeneity in pyroptosis was measured between subtypes. In comparison to cluster 1, cluster 2 had lower expression of pyroptosis-relevant genes, indicating higher activity of pyroptosis in cluster 1 (Figure 7F).

FIGURE 7

Association of characteristic RBPs with clinical features, hub RBPs-based subtypes and immune cell infiltrations

Characteristic RBPs were further verified in the Nephroseq database. DDX27 was significantly up-regulated in chronic kidney disease than normal kidneys, and was negatively correlated to body mass index (Figures 8A,B). Additionally, up-regulated TFB2M was examined in chronic kidney disease compared with normal kidneys (Figure 8C). Moreover, DDX27, RCL1, and TFB2M expressions were relatively higher in cluster 1 than 2 (Figure 8D). All of them were positively correlated to most immune checkpoints in IgA nephropathy samples (Figure 8E).

FIGURE 8

Upstream miRNAs of hub RBPs

We further predicted the upstream miRNAs of hub RBPs. Firstly, we determined miRNAs with differential expression in IgA nephropathy from GSE25590 dataset (Table 1). In Figure 8F, miR-501-3p, miR-760, miR-502-3p, miR-1224-5p, and miR-107 were potential upstream miRNAs of hub RBPs after prediction.

TABLE 1

MiRNAslogFCMean expressiontp-valueFDR
miR-107−2321.433942.143−3.393830.0050610.046251
miR-502-5p−18.620734.11107−3.490960.0042160.039431
miR-483-5p−57.021462.98214−3.505240.0041040.038844
miR-362-5p−92.4143221.6143−3.531170.0039090.037443
let-7g−14493.621030.36−3.579420.0035710.035905
miR-20a−4168.577802.143−3.575680.0035960.035905
let-7i−5956.439032.5−3.631290.003240.033456
miR-374b−274.786384.8929−3.653380.0031090.03333
miR-20b−621.5711,068.571−3.646120.0031520.03333
miR-564−17.421428.675−3.640290.0031860.03333
miR-765−23.416424.74893−3.680640.0029550.032779
miR-185−342.464552.0536−3.761670.002540.029271
miR-134−38.842951.52857−3.780160.0024550.028697
let-7f−8376.1412616.93−3.874170.0020620.026242
miR-500−10.287118.29929−3.850990.0021520.026242
miR-132−33.515740.95643−3.848590.0021620.026242
miR-93−649.364998.6036−3.844380.0021790.026242
miR-15b−7184.2912296.43−3.889470.0020040.026213
miR-500*−45.714364.81429−3.930330.0018580.025039
miR-22−2107.145074.286−3.957750.0017670.024474
miR-23a−29305763.571−3.975190.0017110.024285
miR-103−4019.436833.143−3.997840.0016410.024155
miR-505*−24.997130.57286−4.0110.0016020.024023
miR-16−16049.331073.93−4.028550.0015510.023707
miR-155−412.071809.75−4.129350.0012890.020916
miR-98−165.936218.1036−4.238310.0010570.018676
miR-150*−21.501424.05643−4.276240.0009870.017831
miR-22*−17.384320.66214−4.318550.0009140.016902
miR-768-5p_v11.0−247.293294.6393−4.470320.0006960.014804
miR-18a−160.343218.7−4.457120.0007130.014804
miR-886-3p−31.026431.83679−4.446510.0007260.014804
let-7b−5087.866656.071−4.563110.000590.013556
miR-221*−45.178662.58929−4.556520.0005970.013556
miR-365−42.828660.47857−4.629460.0005240.013304
miR-200b−14.1426.88357−4.617040.0005360.013304
miR-18b−37.771458.52857−4.680060.000480.013129
let-7a−11010.715253.21−4.661640.0004950.013129
miR-24−3990.716935.357−4.925720.0003120.011569
miR-939−41.4357112.7821−4.845190.0003590.011569
miR-30b*−11.101418.99−4.821640.0003740.011569
miR-425−833.1431,305.571−4.793740.0003930.011569
miR-760−11.25569.521464−5.751947.84E-050.009833
miR-574-3p−40.092948.38929−5.655369.17E-050.009833
miR-744−30.7536.99643−5.556790.0001080.009833
miR-671-5p−25.8625.20571−5.34930.0001520.009833
miR-502-3p−50.071469.84286−5.277750.0001710.009833
miR-1224-5p−31.517131.88429−5.205590.0001940.009833
let-7d−2246.862828.714−5.128380.0002210.009833
miR-378*−40.992956.29643−5.09170.0002350.009833
miR-501-5p−13.031414.50571−5.091330.0002350.009833
miR-501-3p−12.565713.22−6.850341.45E-050.004227
miR-23a*−11.200712.90679−6.772261.62E-050.004227
let-7c−450.336552.1893−6.442592.66E-050.004227

The list of miRNAs with differential expression in IgA nephropathy.

Discussion

Currently, anatomopathological evaluation of renal biopsies is crucial for diagnosing IgA nephropathy (Li H. et al., 2021). Nevertheless, percutaneous renal biopsies are often not carried out, and proposed histological classifications (Oxford classification system, etc.) have a few shortcomings (Kouri et al., 2021). As the undesirable clinical outcomes of patients with IgA nephropathy is in part the results of delayed diagnosis, reliable non-invasive biomarkers are urgently required, which could be applied to routine clinical practice (Moresco et al., 2015).

Through integration of three machine learning approaches (LASSO, SVM-RFE, random forest), we finally determined three characteristic RBPs of IgA nephropathy: DDX27, RCL1, and TFB2M. All of them displayed remarkable down-regulation in IgA nephropathy. DDX27 is a member of the DEAD-Box nucleic acid helicase family. Previous evidence demonstrates that DDX27 is involved in tumorigenesis. Specifically, DDX27 facilitates hepatocellular carcinoma progression via activating ERK signaling (Xiaoqian et al., 2021), and enhances stem cell-like features with undesirable survival outcome of breast cancer (Li S. et al., 2021) and colorectal cancer (Yang et al., 2019). Moreover, it heightens colony-forming capacity of gastric cancer cells and results in terrible survival outcome (Tsukamoto et al., 2015), and strengthens colorectal cancer growth and metastasis (Tang et al., 2018). Other studies demonstrate that DDX27 modulates skeletal muscle growth and regeneration through translational processes (Bennett et al., 2018), and modulates 3’ end generation of ribosomal 47S RNA and stably correlates to the PeBoW-complexing (Kellner et al., 2015). RCL1 is essential for co-transcriptional steps in 18 S rRNA biogenesis (Horn et al., 2011). Evidence suggests that RCL1 weakens hepatocellular carcinoma progression (Jiaze et al., 2022). TFB2M is a mitochondrial transcription factor (Basu et al., 2020), and its C-terminal tail is a part of the autoinhibitory mechanisms that modulate DNA binding (Basu et al., 2020). However, no studies have reported the roles of DDX27, RCL1, and TFB2M in IgA nephropathy. Our ROCs demonstrated the excellent efficacy of these characteristic genes in diagnosing IgA nephropathy.

Our GSEA results demonstrated that DDX27, RCL1, and TFB2M were significantly involved in metabolism pathways such as histidine metabolism, glyoxylate and dicarboxylate metabolism, β-alanine metabolism, glycine serine and threonine metabolism, indicating their crucial roles in RNA metabolism. Most immune cell types exhibited increased infiltration levels in IgA nephropathy, comprising CD8 T cells, follicular helper T cells, gamma delta T cells, resting NK cells, activated NK cells, M1 macrophages, M2 macrophages, resting dendritic cells, activated dendritic cells, endothelial cells, and fibroblasts, consistent with previous research (Chen et al., 2021; Tang et al., 2021). Especially, DDX27, RCL1, and TFB2M were significantly linked to most immune cell populations and immune checkpoints, indicating that above characteristic RBPs might participate in modulating immune cell infiltrations during IgA nephropathy progression. Pyroptosis has gained increasing attention due to its relationship to innate immunity and diseases (Yu et al., 2021). Among characteristic RBPs, TFB2M negatively correlated to several pyroptosis-relevant genes, indicating that TFB2M might modulate pyroptosis pathway during IgA nephropathy.

Genome-wide meta-analysis has uncovered the remarkable molecular heterogeneity across IgA nephropathy patients (Li et al., 2020). Here, based on the hub RBPs, we classified IgA nephropathy two subtypes. Most hub RBPs exhibited higher expressions in cluster 1 than 2. There was the notable heterogeneity in pyroptosis between subtypes, with higher activity of pyroptosis in cluster 1. Additionally, most metabolism pathways displayed higher activity in cluster 2 in comparison to cluster 1. We also noted that cluster 2 had lower infiltrations of memory B cells as well as higher infiltrations of plasma cells, M2 macrophages, activated mast cells and neutrophils. Meanwhile, cluster 1 was characterized by elevated expressions of IDO1, TNFRSF8, TNFRSF14, CD40, CD48, CD44 and NRP1. Altogether, the hub RBP-based subtypes exhibited widespread differences in signaling pathways, immune cell infiltrations and immune checkpoints.

Accumulated evidence shows that miRNAs exert crucial roles in the pathogenesis of IgA nephropathy (Noor et al., 2021; Pawluczyk et al., 2021; Xu et al., 2021). Our further analysis demonstrated that miR-501-3p, miR-760, miR-502-3p, miR-1224-5p, and miR-107 were potential upstream miRNAs of hub RBPs, which might post-transcriptionally regulate the expression of hub RBPs during IgA nephropathy. Despite this, the limitations of our study should be pointed out. First, although we tried our best to collect IgA nephropathy public datasets, the sample size was relatively small. Furthermore, the post-transcriptional mechanisms of hub RBPs will be experimentally verified.

Conclusion

Collectively, the present study determined three characteristic RBPs as potential diagnostic biomarkers of IgA nephropathy patients through integrating three machine learning approaches (LASSO, SVM-RFE, random forest). Additionally, we classified IgA nephropathy as two RBPs-based subtypes. Altogether, our findings provided a novel clue on the diagnosis and mechanisms of IgA nephropathy.

Statements

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding authors.

Author contributions

CL and XS conceived and designed the study. XZ, PC, and HJ conducted most of the experiments and data analysis, and wrote the manuscript. SY, GM, and JZ participated in collecting data and helped to draft the manuscript. All authors reviewed and approved the manuscript.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2022.975521/full#supplementary-material

Abbreviations

IgA, immunoglobulin A; RBPs, RNA binding proteins; mRNA, messenger RNA; GEO, gene expression omnibus; sva, surrogate variable analysis; PCA, principal component analysis; miRNA, microRNA; limma, linear models for microarray data; GO, gene ontology; KEGG, kyoto encyclopedia of genes and genomes; PPI, protein-protein interaction; STRING, search tool for the retrieval of interacting genes; MCODE, molecular complex detection; LASSO, least absolute shrinkage and selection operator; SVM-RFE, support vector machine-recursive feature elimination; GSEA, gene set enrichment analysis; MSigDB, molecular signatures database; GSVA, gene set variation analysis; ssGSEA, single sample gene set enrichment analysis; ROCs, receiver operating characteristic curves.

References

  • 1

    BasuU.MishraN.FarooquiM.ShenJ.JohnsonL. C.PatelS. S. (2020). The C-terminal tails of the mitochondrial transcription factors Mtf1 and TFB2M are part of an autoinhibitory mechanism that regulates DNA binding. J. Biol. Chem.295 (20), 68236830. 10.1074/jbc.RA120.013338

  • 2

    BennettA. H.O'DonohueM. F.GundryS. R.ChanA. T.WidrickJ.DraperI.et al (2018). RNA helicase, DDX27 regulates skeletal muscle growth and regeneration by modulation of translational processes. PLoS Genet.14 (3), e1007226. 10.1371/journal.pgen.1007226

  • 3

    ChenZ.ZhangT.MaoK.ShaoX.XuY.ZhuM.et al (2021). A single-cell survey of the human glomerulonephritis. J. Cell. Mol. Med.25 (10), 46844695. 10.1111/jcmm.16407

  • 4

    GautierL.CopeL.BolstadB. M.IrizarryR. A. (2004). affy--analysis of Affymetrix GeneChip data at the probe level. Bioinformatics20 (3), 307315. 10.1093/bioinformatics/btg405

  • 5

    GoemanJ. J. (2010). L1 penalized estimation in the Cox proportional hazards model. Biom. J.52 (1), 7084. 10.1002/bimj.200900028

  • 6

    GuoF.ZhangW.SuJ.XuH.YangH. (2019). Prediction of drug positioning for quan-du-zhong capsules against hypertensive nephropathy based on the robustness of disease network. Front. Pharmacol.10, 49. 10.3389/fphar.2019.00049

  • 7

    HahnW. H.SuhJ. S.ChoS. H.ChoB. S.KimS. D. (2010). Polymorphisms of signal transducers and activators of transcription 1 and 4 (STAT1 and STAT4) contribute to progression of childhood IgA nephropathy. Cytokine50 (1), 6974. 10.1016/j.cyto.2009.12.004

  • 8

    HänzelmannS.CasteloR.GuinneyJ. (2013). Gsva: Gene set variation analysis for microarray and RNA-seq data. BMC Bioinforma.14, 7. 10.1186/1471-2105-14-7

  • 9

    HornD. M.MasonS. L.KarbsteinK. (2011). Rcl1 protein, a novel nuclease for 18 S ribosomal RNA production. J. Biol. Chem.286 (39), 3408234087. 10.1074/jbc.M111.268649

  • 10

    JiazeY.SinanH.MinjieY.YongjieZ.NanD.LiangwenW.et al (2022). Rcl1 suppresses tumor progression of hepatocellular carcinoma: A comprehensive analysis of bioinformatics and in vitro experiments. Cancer Cell. Int.22 (1), 114. 10.1186/s12935-022-02533-x

  • 11

    KellnerM.RohrmoserM.FornéI.VossK.BurgerK.MühlB.et al (2015). DEAD-box helicase DDX27 regulates 3' end formation of ribosomal 47S RNA and stably associates with the PeBoW-complex. Exp. Cell. Res.334 (1), 146159. 10.1016/j.yexcr.2015.03.017

  • 12

    KouriN. M.StangouM.LiouliosG.MitsoglouZ.SerinoG.ChiurliaS.et al (2021). Serum levels of miR-148b and let-7b at diagnosis may have important impact in the response to treatment and long-term outcome in IgA nephropathy. J. Clin. Med.10 (9), 1987. 10.3390/jcm10091987

  • 13

    LeekJ. T.JohnsonW. E.ParkerH. S.JaffeA. E.StoreyJ. D. (2012). The sva package for removing batch effects and other unwanted variation in high-throughput experiments. Bioinformatics28 (6), 882883. 10.1093/bioinformatics/bts034

  • 14

    LiH.ChenZ.ChenW.LiJ.LiuY.MaH.et al (2021). MicroRNA-23b-3p deletion induces an IgA nephropathy-like disease associated with dysregulated mucosal IgA synthesis. J. Am. Soc. Nephrol.32 (10), 25612578. 10.1681/asn.2021010133

  • 15

    LiM.WangL.ShiD. C.FooJ. N.ZhongZ.KhorC. C.et al (2020). Genome-wide meta-analysis identifies three novel susceptibility loci and reveals ethnic heterogeneity of genetic susceptibility for IgA nephropathy. J. Am. Soc. Nephrol.31 (12), 29492963. 10.1681/asn.2019080799

  • 16

    LiS.MaJ.ZhengA.SongX.ChenS.JinF. (2021). DEAD-box helicase 27 enhances stem cell-like properties with poor prognosis in breast cancer. J. Transl. Med.19 (1), 334. 10.1186/s12967-021-03011-0

  • 17

    LiY.XiaM.PengL.LiuH.ChenG.WangC.et al (2021). Downregulation of miR-214-3p attenuates mesangial hypercellularity by targeting PTEN-mediated JNK/c-Jun signaling in IgA nephropathy. Int. J. Biol. Sci.17 (13), 33433355. 10.7150/ijbs.61274

  • 18

    LiberzonA.BirgerC.ThorvaldsdóttirH.GhandiM.MesirovJ. P.TamayoP. (2015). The Molecular Signatures Database (MSigDB) hallmark gene set collection. Cell. Syst.1 (6), 417425. 10.1016/j.cels.2015.12.004

  • 19

    LiuP.LassénE.NairV.BerthierC. C.SuguroM.SihlbomC.et al (2017). Transcriptomic and proteomic profiling provides insight into mesangial cell function in IgA nephropathy. J. Am. Soc. Nephrol.28 (10), 29612972. 10.1681/asn.2016101103

  • 20

    MoldoveanuZ.SuzukiH.ReilyC.SatakeK.NovakL.XuN.et al (2021). Experimental evidence of pathogenic role of IgG autoantibodies in IgA nephropathy. J. Autoimmun.118, 102593. 10.1016/j.jaut.2021.102593

  • 21

    MorescoR. N.SpeeckaertM. M.DelangheJ. R. (2015). Diagnosis and monitoring of IgA nephropathy: The role of biomarkers as an alternative to renal biopsy. Autoimmun. Rev.14 (10), 847853. 10.1016/j.autrev.2015.05.009

  • 22

    NagasawaY.OkuzakiD.MusoE.YamamotoR.ShinzawaM.IwasakiY.et al (2016). IFI27 is a useful genetic marker for diagnosis of immunoglobulin A nephropathy and membranous nephropathy using peripheral blood. PLoS One11 (4), e0153252. 10.1371/journal.pone.0153252

  • 23

    NewmanA. M.LiuC. L.GreenM. R.GentlesA. J.FengW.XuY.et al (2015). Robust enumeration of cell subsets from tissue expression profiles. Nat. Methods12 (5), 453457. 10.1038/nmeth.3337

  • 24

    NiuX.ChenL.LiY.HuZ.HeF. (2022). Ferroptosis, necroptosis, and pyroptosis in the tumor microenvironment: Perspectives for immunotherapy of SCLC. Semin. Cancer Biol.12. 10.1016/j.semcancer.2022.03.009

  • 25

    NoorF.SaleemM. H.AslamM. F.AhmadA.AslamS. (2021). Construction of miRNA-mRNA network for the identification of key biological markers and their associated pathways in IgA nephropathy by employing the integrated bioinformatics analysis. Saudi J. Biol. Sci.28 (9), 49384945. 10.1016/j.sjbs.2021.06.079

  • 26

    PawluczykI. Z. A.DidangelosA.BarbourS. J.ErL.BeckerJ. U.MartinR.et al (2021). Differential expression of microRNA miR-150-5p in IgA nephropathy as a potential mediator and marker of disease progression. Kidney Int.99 (5), 11271139. 10.1016/j.kint.2020.12.028

  • 27

    RitchieM. E.PhipsonB.WuD.HuY.LawC. W.ShiW.et al (2015). Limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res.43 (7), e47. 10.1093/nar/gkv007

  • 28

    SerinoG.SallustioF.CoxS. N.PesceF.SchenaF. P. (2012). Abnormal miR-148b expression promotes aberrant glycosylation of IgA1 in IgA nephropathy. J. Am. Soc. Nephrol.23 (5), 814824. 10.1681/asn.2011060567

  • 29

    ShannonP.MarkielA.OzierO.BaligaN. S.WangJ. T.RamageD.et al (2003). Cytoscape: A software environment for integrated models of biomolecular interaction networks. Genome Res.13 (11), 24982504. 10.1101/gr.1239303

  • 30

    SternburgE. L.KarginovF. V. (2020). Global approaches in studying RNA-binding protein interaction networks. Trends biochem. Sci.45 (7), 593603. 10.1016/j.tibs.2020.03.005

  • 31

    SubramanianA.TamayoP.MoothaV. K.MukherjeeS.EbertB. L.GilletteM. A.et al (2005). Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl. Acad. Sci. U. S. A.102 (43), 1554515550. 10.1073/pnas.0506580102

  • 32

    SuzukiH.NovakJ. (2021). IgA glycosylation and immune complex formation in IgAN. Semin. Immunopathol.43 (5), 669678. 10.1007/s00281-021-00883-8

  • 33

    TangJ.ChenH.WongC. C.LiuD.LiT.WangX.et al (2018). DEAD-box helicase 27 promotes colorectal cancer growth and metastasis and predicts poor survival in CRC patients. Oncogene37 (22), 30063021. 10.1038/s41388-018-0196-1

  • 34

    TangR.MengT.LinW.ShenC.OoiJ. D.EggenhuizenP. J.et al (2021). A partial picture of the single-cell transcriptomics of human IgA nephropathy. Front. Immunol.12, 645988. 10.3389/fimmu.2021.645988

  • 35

    TsukamotoY.FumotoS.NoguchiT.YanagiharaK.HirashitaY.NakadaC.et al (2015). Expression of DDX27 contributes to colony-forming ability of gastric cancer cells and correlates with poor prognosis in gastric cancer. Am. J. Cancer Res.5 (10), 29983014.

  • 36

    WilkersonM. D.HayesD. N. (2010). ConsensusClusterPlus: A class discovery tool with confidence assessments and item tracking. Bioinformatics26 (12), 15721573. 10.1093/bioinformatics/btq170

  • 37

    WuC. Y.HuaK. F.HsuW. H.SuzukiY.ChuL. J.LeeY. C.et al (2020). IgA nephropathy benefits from compound K treatment by inhibiting NF-κB/NLRP3 inflammasome and enhancing autophagy and SIRT1. J. Immunol.205 (1), 202212. 10.4049/jimmunol.1900284

  • 38

    WuX.XuL. (2022). The RNA-binding protein HuR in human cancer: A friend or foe?Adv. Drug Deliv. Rev.184, 114179. 10.1016/j.addr.2022.114179

  • 39

    XiaoqianW.BingZ.YangweiL.YafeiZ.TingtingZ.YiW.et al (2021). DEAD-Box helicase 27 promotes hepatocellular carcinoma progression through ERK signaling. Technol. Cancer Res. Treat.20, 15330338211055953. 10.1177/15330338211055953

  • 40

    XieX.LiuP.GaoL.ZhangX.LanP.BijolV.et al (2021). Renal deposition and clearance of recombinant poly-IgA complexes in a model of IgA nephropathy. J. Pathol.254 (2), 159172. 10.1002/path.5658

  • 41

    XuY.HeY.HuH.XuR.LiaoY.DongX.et al (2021). The increased miRNA-150-5p expression of the tonsil tissue in patients with IgA nephropathy may be related to the pathogenesis of disease. Int. Immunopharmacol.100, 108124. 10.1016/j.intimp.2021.108124

  • 42

    YangC.LiD.BaiY.SongS.YanP.WuR.et al (2019). DEAD-box helicase 27 plays a tumor-promoter role by regulating the stem cell-like activity of human colorectal cancer cells. Onco. Targets. Ther.12, 233241. 10.2147/ott.S190814

  • 43

    YuG.WangL. G.HanY.HeQ. Y. (2012). clusterProfiler: an R package for comparing biological themes among gene clusters. Omics16 (5), 284287. 10.1089/omi.2011.0118

  • 44

    YuP.ZhangX.LiuN.TangL.PengC.ChenX. (2021). Pyroptosis: Mechanisms and diseases. Signal Transduct. Target. Ther.6 (1), 128. 10.1038/s41392-021-00507-5

  • 45

    ZengH.WangL.LiJ.LuoS.HanQ.SuF.et al (2021). Single-cell RNA-sequencing reveals distinct immune cell subsets and signaling pathways in IgA nephropathy. Cell. Biosci.11 (1), 203. 10.1186/s13578-021-00706-1

  • 46

    ZhengM.KannegantiT. D. (2020). The regulation of the ZBP1-NLRP3 inflammasome and its implications in pyroptosis, apoptosis, and necroptosis (PANoptosis). Immunol. Rev.297 (1), 2638. 10.1111/imr.12909

Summary

Keywords

IgA nephropathy, RNA binding proteins, machine learning, diagnosis, subtypes, immunity, pyroptosis

Citation

Zhang X, Chao P, Jiang H, Yang S, Muhetaer G, Zhang J, Song X and Lu C (2022) Integration of three machine learning algorithms identifies characteristic RNA binding proteins linked with diagnosis, immunity and pyroptosis of IgA nephropathy. Front. Genet. 13:975521. doi: 10.3389/fgene.2022.975521

Received

22 June 2022

Accepted

12 September 2022

Published

28 September 2022

Volume

13 - 2022

Edited by

Muhammad Hamzah Saleem, University of Chinese Academy of Sciences, China

Reviewed by

Muhammad Ahsan Naeem, University of Veterinary and Animal Sciences, Pakistan

Aftab Shaukat, Huazhong Agricultural University, China

Sidra Aslam, Government College University, Faisalabad, Pakistan

Updates

Copyright

*Correspondence: Xue Song, ; Chen Lu,

†These authors have contributed equally to this work and share first authorship

This article was submitted to RNA, a section of the journal Frontiers in Genetics

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics