Skip to main content

ORIGINAL RESEARCH article

Front. Immunol., 30 August 2024
Sec. Inflammation

Identification of immune-related biomarkers for intracerebral hemorrhage diagnosis based on RNA sequencing and machine learning

  • 1Department of Clinical Laboratory Medicine, Xijing Hospital, Fourth Military Medical University, Xi’an, China
  • 2State Key Laboratory of Cardiovascular Disease, Fuwai Hospital, National Center for Cardiovascular Diseases, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
  • 3Neurosurgery, Affiliated Hospital of Hebei University, Baoding, Hebei, China

Background: Intracerebral hemorrhage (ICH) is a severe stroke subtype with high morbidity, disability, and mortality rates. Currently, no biomarkers for ICH are available for use in clinical practice. We aimed to explore the roles of RNAs in ICH pathogenesis and identify potential diagnostic biomarkers.

Methods: We collected 233 individual blood samples from two independent cohorts, including 64 patients with ICH, 59 patients with ischemic stroke (IS), 60 patients with hypertension (HTN) and 50 healthy controls (CTRL) for RNA sequencing. Differentially expressed genes (DEGs) analysis, gene set enrichment analysis (GSEA), and weighted correlation network analysis (WGCNA) were performed to identify ICH-specific modules. The immune cell composition was evaluated with ImmuneCellAI. Multiple machine learning algorithms to select potential biomarkers for ICH diagnosis, and further validated by quantitative real-time polymerase chain reaction (RT−PCR). Receiver operating characteristic (ROC) curve analysis and decision curve analysis (DCA) were performed to evaluate the diagnostic value of the signature for ICH. Finally, we generated M1 and M2 macrophages to investigate the expression of candidate genes.

Results: In both cohorts, 519 mRNAs and 131 lncRNAs were consistently significantly differentially expressed between ICH patients and HTN controls. Gene function analysis suggested that immune system processes may be involved in ICH pathology. ImmuneCellAI analysis revealed that the abundances of 11 immune cell types were altered after ICH in both cohorts. WGCNA and GSEA identified 18 immune-related DEGs. Multiple algorithms identified an RNA panel (CKAP4, BCL6, TLR8) with high diagnostic value for discriminating ICH patients from HTN controls, CTRLs and IS patients (AUCs: 0.93, 0.95 and 0.82; sensitivities: 81.3%, 84.4% and 75%; specificities: 100%, 96% and 79.7%, respectively). Additionally, CKAP4 and TLR8 mRNA and protein levels decreased in RAW264.7 M1 macrophages and increased in RAW264.7 M2 macrophages, while BCL6 expression increased in M1 macrophages but not in M2 macrophages, which may provide potential therapeutic targets for ICH.

Conclusions: This study demonstrated that the expression levels of lncRNAs and mRNAs are associated with ICH, and an RNA panel (CKAP4, BCL6, TLR8) was developed as a potential diagnostic tool for distinguishing ICH from IS and controls, which could provide useful insight into ICH diagnosis and pathogenesis.

Introduction

Stroke remains the second leading cause of death worldwide and has been regarded as a global health burden at both the individual and societal levels (1). The estimated number of incident strokes was 13.7 million in 2016, approximately 87% of which were ischemic stroke (IS) (2). In the same year, IS and intracerebral hemorrhage (ICH) accounted for 2.7 million and 2.8 million deaths, respectively (3). ICH accounts for approximately 23.8% of strokes in China, with a mortality rate of 67.9%, which is higher than that of IS (4). Hypertension (HTN) is the most common risk factor for stroke (5, 6), accounting for approximately 65% of all ICHs (7, 8). Although effective HTN management has reduced the incidence of ICH in some high-income countries, the incidence and prevalence of ICH have increased in China (5), and HTN remains the greatest risk factor for ICH. Currently, the diagnosis of stroke depends on neuroimaging techniques, and clinicians often face diagnostic challenges in distinguishing between ICH and acute IS because clinical findings can be vague and neuroimaging (especially magnetic resonance imaging) is difficult. Early identification of patients with acute IS is essential because reperfusion therapy can be administered soon after stroke onset, which is very important for achieving recovery and a good prognosis in IS patients (9). In recent years, the detection of biomarkers has become important for assisting in the early diagnosis of stroke. However, none of these methods has proven to be completely reliable or has become a clinical standard. The currently used stroke biomarkers are limited by insufficient specificity, difficulties related to detection and acquisition, a detection time beyond the thrombolysis window, and establishment on the basis of a small sample size (10, 11). Thus, identifying potential diagnostic biomarkers and understanding the pathophysiological mechanisms underlying the development of ICH are essential.

Blood samples are easily accessible and acceptable for patients, which makes them attractive for biomarker discovery and validation (12). Emerging evidence has revealed that peripheral blood cells play vital roles in the neurological injury caused by ICH and that global messenger RNA (mRNA) and noncoding RNA (ncRNA) expression profiles are altered rapidly in the blood after ICH (1315). ncRNAs are involved in various biological processes associated with stroke (16, 17) and are potential biomarkers and therapeutic tools. ncRNAs include long noncoding RNAs (lncRNAs), circular RNAs (circRNAs) and microRNAs (miRNAs), which are involved in the regulation of transcription and translation (18). Our previous studies focused on identifying circRNAs as potential biomarkers for ICH diagnosis (19, 20). In this study, we aimed to investigate the expression profiles of lncRNAs and mRNAs by using RNA sequencing data from two independent cohorts and investigated the potential functions of the identified RNAs via gene ontology (GO) and pathway analyses. We aimed to explore RNA expression profiles and functions to identify specific mRNAs and lncRNAs as potential biomarkers for the diagnosis of ICH, which might provide useful insight into the pathogenesis of ICH and a more effective diagnostic tool.

Materials and methods

Study subjects

The participants enrolled in this study were described in our previous study (19, 20). The study protocol was reviewed and approved by the Human Ethics Committee, Fuwai Hospital (Approval No. 2016-732), and the study was conducted in accordance with the principles of Good Clinical Practice and the Declaration of Helsinki. Written informed consent was obtained from all study participants or their legal proxies.

In brief, a total of 273 individuals, including individuals with ICH (n=84), IS (n = 59), or HTN (n= 60) and age-matched healthy controls (CTRLs, n=70), were recruited from three individual cohorts between 2014 and 2024. In the first cohort, 160 individuals (44 ICH patients, 43 IS patients, 42 HTN controls and 31 CTRLs) from Cangzhou Central Hospital who were enrolled between 2014 and 2017 composed the discovery phase. In the second cohort, 20 consecutive ICH patients from Hebei Baoding Hospital, another 18 HTN controls and 16 IS patients from the General Hospital of Ningxia Medical University, and 19 CTRLs from Tsinghua University Hospital were enrolled between 2017 and 2019. Additionally, 20 ICH patients and 20 CTRLs from Xijing Hospital were enrolled as an independent validation cohort between 2023 and 2024. ICH and IS were diagnosed by professional neurologists on the basis of medical history and exams and confirmed by computed tomography (CT) or magnetic resonance imaging (MRI) (21). HTN controls with simple HTN but without a history of previous stroke or cardiovascular events were selected as the HTN control group, and the CTRLs group was matched for age, sex, and vascular risk factors, including diabetes mellitus, hyperlipidemia, and smoking and drinking status. Information on demographic and clinical characteristics was obtained through face−to-face surveys and by checking hospital records or medical examination records (Tables 1, 2). The exclusion criteria included autoimmune diseases, cardiac disease, liver diseases, renal diseases, cancer, and a history of previous stroke or IS with hemorrhagic transformation.

Table 1
www.frontiersin.org

Table 1. Demographics and characteristics of the discovery cohorts.

Table 2
www.frontiersin.org

Table 2. Demographics and characteristics of the validation cohorts.

RNA sequencing and data analysis

Blood samples from patients with spontaneous ICH within 48 hours of admission or acute IS patients were collected for transcriptome analysis. RNA isolation and sequencing were performed as previously described (19, 20). Library construction and sequencing were performed by Annoroad Gene Technology (Beijing, China) via Illumina’s NEBNext Ultra Directional RNA Library Prep Kit (NEB, Ipswich, USA). Clustering of the index-coded samples was performed on a cBot 2 cluster generation system via the TruSeq PE Cluster Kit v4-cBot-HS (Illumina, CA, USA) according to the manufacturer’s instructions. After cluster generation, the libraries were sequenced on an Illumina HiSeq 2500 platform for 150 bp paired-end reads. All reads were mapped to the human genome hg19 via the STAR2.4.1d aligner. The DESeq2 (22) and edgeR (23) packages were used to normalize the FPKM values and identify significant differentially expressed RNAs. Significantly differentially expressed genes (DEGs) between the two groups were identified as those with a |fold change| ≥ 2 and an adjusted p value (FDR) < 0.05. P values were corrected for multiple testing with the Benjamini–Hochberg method. Hierarchical clustering was performed, and heatmaps were generated on the basis of the normalized values of all DEGs using the R package. Venn diagrams and volcano plots were generated to visualize the consistently significant DEGs between the two cohorts. The RNA-seq data have been deposited into the Genome Sequence Archive in the National Genomics Data Center, China National Center for Bioinformation/Beijing Institute of Genomics, Chinese Academy of Sciences under accession number HRA001807, and they are publicly accessible at https://ngdc.cncb.ac.cn/gsa-human (19).

DEG functional enrichment

Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analyses were performed to annotate the potential functions of the DEGs. A false discovery rate (FDR) < 0.05 was set as the cutoff for significantly enriched GO terms and KEGG pathways. Furthermore, gene set enrichment analysis (GSEA) was used to analyze the common DEGs in the two cohorts via the clusterProfiler R package, as previously described (24). The gene sets were analyzed on the basis of the KEGG and Reactome pathways. Enriched gene sets were assigned on the basis of a nominal p value<0.05 and a FDR <0.25.

Construction of the lncRNA−mRNA coexpression network

To explore the correlation of differentially expressed lncRNAs and differentially expressed mRNAs, we constructed a lncRNA−mRNA coexpression network via Pearson correlation coefficient analysis. Coexpressed lncRNA−mRNA pairs were defined as those with a rho value > 0.85 and a p value <0.05. A correlation network was constructed via the OmicStudio tools at https://www.omicstudio.cn/tool.

Weighted gene correlation network analysis

A co−expression network was constructed with the WGCNA package to identify the correlations among genes and identify highly correlated gene modules and potential biomarkers. The ICH, IS, HTN and CTRL group data were analyzed via the R package WGCNA (25). Pearson’s correlation analysis was performed to identify coexpressed genes, and an adjacency matrix was constructed on the basis of soft-thresholding (β = 9). Then, we created a topological overlap matrix (TOM) to visualize the connections among genes. Modules were identified via hierarchical clustering via the TOM and the dynamic tree cut algorithm. A gene significance > 0.2 and a module membership > 0.6 were calculated for individual modules to determine the most important genes. R > 0.5 and P < 0.05 were considered to indicate that a module was significant and should be selected for additional processing.

Evaluation of immune cell composition via ImmuneCellAI

We evaluated the immune cell composition of the two comparison groups in the two cohorts via the ImmuneCellAI website (https://guolab.wchscu.cn/ImmuCellAI/), as previously described (26). The abundances of 24 immune cell types and 18 T-cell subtypes were calculated, and the abundance of immune cells in each sample was determined for further comparison. P<0.05 was considered to indicate statistical significance.

Identification of candidate immune-related biomarkers with multiple machine learning algorithms

To identify immune-related biomarkers for ICH, we used least absolute shrinkage and selection operator (LASSO), support vector machine recursive feature elimination (SVM-RFE) algorithms, XGBoost-RET and Boruta algorithms to rank the importance of features according to RNA expression levels in all samples. The intersection of the candidate biomarkers was used to further assess their diagnostic value in discriminating ICH patients from patients in other groups according to the expression levels of RNAs via eight machine learning classification algorithms, namely, support vector machine (SVM), K-nearest neighbor (KNN), logistic regression (LR), random forest (RF), Gaussian naive Bayes (GNB), AdaBoost, light gradient boosting machine (LGBM) and eXtreme gradient boosting (XGB). The area under the curve (AUC), sensitivity, specificity, accuracy, positive predictive value (PPV), and negative predictive value (NPV) were computed. The programs used to run the algorithms and the specific protocols or tools used to assess diagnostic value on the Beckman Coulter DxAI platform (https://www.xsmartanalysis.com/beckman/login/).

Validation via real-time polymerase chain reaction

Candidate biomarkers (CKAP4, BCL6 and TLR8) were selected for validation via quantitative real-time polymerase chain reaction (RT−PCR). Total RNA from peripheral blood white blood cells was isolated via TRIzol reagent (Invitrogen). cDNA synthesis was completed via the use of 1 µg of total RNA and a Transcriptor First Stand cDNA Synthesis Kit (Takara, Dalian, China). RT−PCR was performed via SYBR Master Mix (Yeasen, Shanghai, China) according to the manufacturer’s instructions. The RNA primers were designed via the NCBI Primer-BLAST website. The primers used in this study are listed in Supplementary Table 9. The target gene mRNA levels were quantified via normalization to those of the standard housekeeping gene gapdh, which served as an internal control.

Cell culture

RAW264.7 cells were obtained from the Cell Resource Center, Peking Union Medical College (which is part of the National Science and Technology Infrastructure, the National Biomedical Cell-Line Resource, NSTI-BMCR; http://cellresource.cn). The cells were cultured in DMEM supplemented with 10% fetal bovine serum, 100 U/mL penicillin, and 100 mg/mL streptomycin in a 5% CO2 incubator at 37°C. Cells obtained from passages three to six were used in the study. For macrophage polarization, lipopolysaccharide (LPS, 100 ng/ml; Sigma−Aldrich; St. Louis, MO) and interferon-γ (IFN-γ, 20 ng/ml; PeproTech; Rocky Hill, NJ) were added to the culture mixture for 24 h to induce M1 polarization, and interleukin-4 (IL-4, 20 ng/ml; PeproTech; Rocky Hill, NJ) was added to the culture mixture for 24 h to induce M2 polarization.

Western blot analysis

Cellular protein was extracted via RIPA lysis buffer containing a protease inhibitor cocktail (Roche). After homogenization on ice and centrifugation, total protein was mixed with loading buffer and separated on 4–12% NuPAGE Bis−Tris gels (Invitrogen) before being transferred to a nitrocellulose membrane. After being blocked with 5% nonfat milk containing Tris-buffered saline, the membranes were blotted with primary antibodies overnight at 4°C. The primary antibodies used were as follows: a monoclonal antibody against BCL6 (1:500, ab241549; Abcam, Cambridge, MA, USA), a polyclonal antibody against TLR8 (1:1000, ab8245; Abcam, Cambridge, MA, USA), a polyclonal antibody against CKAP4 (1:2000, 16686-1-AP; Proteintech, Wuhan, China), and a monoclonal antibody against GAPDH (1:10000, ab8245; Abcam, Cambridge, MA, USA). The membranes were subsequently washed four times for 5 min each with TBST buffer and incubated with a horseradish peroxide-conjugated secondary antibody (1:5000, SA00001-1, SA00001-2, Proteintech, Wuhan, China) at room temperature for 2 h. After washing, the membrane was developed with enhanced chemiluminescence reagent (Invitrogen, USA), and a ChemiDoc MP Imaging System (Bio-Rad, USA) was used for signal detection. Protein expression was quantified and normalized to that of GAPDH, which was used as an internal control. The signal density was quantified via ImageJ software (version 1.52a, NIH, USA).

Histology and immunofluorescence

Spontaneous ICH mouse models were induced and assessed via MRI as described previously (27, 28). Animal experiments were approved by the Committee on the Ethics of Animal Experiments at Fuwai Hospital (approval No.: 0085-M-200-HX) and complied with the National Institutes of Health (NIH)’s Guide for the Care and Use of Laboratory Animals. The manuscript adheres to Animal Research: Reporting of In Vivo Experiments (ARRIVE) guidelines for reporting animal experiments. The brains were harvested and fixed in 4% paraformaldehyde for 6 h and then cryoprotected in 20% sucrose overnight at 4°C. Frozen coronal sections were cut at a thickness of 20 μm. Hematoxylin−eosin (HE) staining was applied to observe hemorrhages. For immunofluorescence staining, frozen brain sections were incubated in goat serum (ZSGB-BIO, Beijing, China) with 0.3% Triton X-100 for blocking. For BCL6, after above incubation, sections were blocked with Mouse on Mouse (MOM) Blocking Reagent (Vector Laboratories, MKB-2213-1, Burlingame, CA, USA) for one hour at room temperature to reduce non-specific binding, followed by incubation with primary antibodies overnight at 4°C. The primary antibodies used were as follows: mouse anti-BCL6 (1:100, ab241549; Abcam, Cambridge, MA, USA), rabbit anti-TLR8 (1:100, ab8245; Abcam, Cambridge, MA, USA), and rabbit anti-CKAP4 (1:100, 16686-1-AP; Proteintech, Wuhan, China). Rabbit or mouse isotype antibodies were used as negative controls. After several washes, the sections were incubated with secondary antibodies (Alexa Fluor 594-conjugated goat anti-mouse IgG, Alexa Fluor 488-conjugated goat anti-rabbit IgG; 1:300, ZSGB-BIO, Beijing, China) at room temperature for 30 min. Coverslips were mounted with Vecta Shield medium containing DAPI to stain the nuclei. All images were visualized via an FV3000 laser scanning confocal microscope.

Statistical analysis

Statistical analysis was performed via SPSS 21.0 (IBM Corp., NY, USA). The sample distribution was determined via the Kolmogorov–Smirnov normality test. For parametric data, two-tailed unpaired Student’s t tests were used to evaluate differences between two groups. One-way ANOVA and the Bonferroni post hoc correction were performed when more than two groups were evaluated. The data are presented as the means ± standard deviations or medians (interquartile ranges). Statistical comparisons of percentages were performed via chi-square tests. For RNA sequencing analysis, DEGs were selected if significant differences (fold change ≥ 2 and FDR < 0.05) between two groups were observed via an unpaired Mann−Whitney test. Spearman’s correlation analysis was performed to investigate the correlations between ICH risk factors and candidate RNAs. Receiver operating characteristic (ROC) curve analysis and decision curve analysis (DCA) were used as accuracy indices for evaluating the diagnostic performance of the selected RNA panel. P < 0.05 was considered to indicate statistical significance.

Results

Characteristics and demographics of the study population

To investigate the expression profiles of mRNAs and lncRNAs associated with the occurrence and development of ICH, we performed RNA sequencing (RNA-seq) of the discovery and validation cohorts. The average age of the 233 subjects in this study was 57.58 ± 6.99 years (± SD), and 119 subjects (51.07%) were male. The demographics and characteristics of the ICH patients, IS patients, HTN controls and CTRLs in the discovery (n = 160) and validation (n = 73) cohorts are shown in Tables 1, 2. No significant differences in sex, age or incidence rates of diabetes mellitus or hyperlipidemia existed among the patients with ICH, IS, or HTN and the matched controls in either the discovery or validation cohorts. The workflow of this study is shown in Supplementary Figure 1.

LncRNA and mRNA expression profiles are significantly altered in ICH patients in both the discovery and validation cohorts

In total, 519 mRNAs and 131 lncRNAs (fold change > 2 and FDR < 0.05) were consistently significantly differentially expressed between ICH patients and HTN controls (Figures 1A-F; Supplementary Tables 1, 2) in both cohorts according to the DESeq2 and EdgeR results. Volcano plots were generated to evaluate the variation and reproducibility of lncRNA and mRNA expression in ICH patients and HTN controls in the discovery (Figures 1A, B) and validation (Figures 1D, E) cohorts. The expression patterns of mRNAs and lncRNAs in ICH patients and HTN controls in the discovery (Figure 1G) and validation (Figure 1H) cohorts were distinguished via hierarchical clustering and heatmaps. Similarly, 751 mRNAs and 166 lncRNAs were consistently significantly differentially expressed between ICH patients and CTRLs (Supplementary Figures 2A, B; Supplementary Tables 3, 4), and 207 mRNAs and 45 lncRNAs were consistently significantly differentially expressed between ICH patients and IS patients according to the same methods (Supplementary Figures 2C, D; Supplementary Tables 5, 6). Moreover, we identified 157 differentially expressed mRNAs and 41 differentially expressed lncRNAs that overlapped among the three comparison groups (ICH vs. HTN, ICH vs. CTRL and ICH vs. IS) and selected them for further analysis (Figures 1I, J).

Figure 1
www.frontiersin.org

Figure 1. Differentially expressed genes between intracerebral hemorrhage (ICH) patients and hypertension (HTN) controls in the discovery and validation cohorts. Volcano plots of the mRNA (A, D) and lncRNA (B, E) expression profiles of ICH patients and HTN controls (fold change ≥2 and FDR < 0.05) in the discovery (A, B) and validation (D, E) cohorts. The red dots represent upregulated genes, and the blue dots represent downregulated genes. (C, F) Venn diagram showing the consistently altered mRNAs (C) and lncRNAs (F) in ICH patients compared with HTN controls in the discovery and validation cohorts according to both the DESeq2 and edgeR methods. (G, H) Hierarchical clustering of genes that were consistently differentially expressed between ICH patients and HTN controls in the discovery (G) and validation (H) cohorts. Blue represents downregulated genes, red represents upregulated genes, and gray represents genes whose expression did not change. Each column represents a sample, and each row represents a single gene. (I, J) Venn diagram showing the consistently differentially expressed mRNAs (I) and lncRNAs (J) between the comparison groups (ICH vs. HTN, ICH vs. CTRL and ICH vs. IS) in both cohorts. (K) The top 20 Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways associated with consistently differentially expressed mRNAs. Statistical significance levels were corrected for multiple testing using the Benjamini–Hochberg procedure.

Construction of the lncRNA−mRNA coexpression network

Genes with the same expression pattern may function together. To explore the relationships between the 157 differentially expressed mRNAs and 41 differentially expressed lncRNAs in all the samples, we constructed a lncRNA−mRNA coexpression network via Pearson correlation coefficient analysis. We defined lncRNA−mRNA pairs as coexpressed if the absolute value of rho was >0.85 and the p value was < 0.05. The lncRNA−mRNA coexpression network contained 49 nodes (36 mRNAs and 13 lncRNAs) and 52 connections. The top 5 nodes were RP11-574K11.5, CTB-61M7.2, RP11-483F11.7, AC098823.3 and LINC00671, with a degree >5 (Supplementary Figure 3). The top 10 lncRNA−mRNA coexpression pairs were RP11−574K11.5− S100A12, RP11−574K11.5− GPR84, RP11−483F11.7− ENTPD7, RP11−36B6.1− AP3B2, LINC00671− MMP9, RP11−574K11.5− HP, RP11−483F11.7− UGCG, CTB−61M7.2− SLC2A3, RP11−574K11.5− C19orf59, and RP11−574K11.5− SLC51A (Supplementary Table 7), which may play vital roles in the regulation of ICH pathogenesis.

Functional enrichment and pathway analysis of DEGs

Functional enrichment analyses were performed to further explore the functions of the DEGs between patients with ICH and matched controls. We observed that the enriched GO terms for the significant DEGs included immune response, immune system process, regulation of biological response, and receptor binding (Supplementary Figure 4), which are closely related to the pathophysiology of ICH. KEGG pathway analysis of the DEGs revealed that the cytokine−cytokine receptor interaction, MAPK signaling pathway, PI3K–Akt signaling pathway, ECM–receptor interaction, Notch signaling pathway, B-cell receptor signaling pathway, Th1 and Th2 cell differentiation, TNF signaling pathway and Th17 cell differentiation pathways were significantly enriched in both cohorts (Figure 1K). Gene set enrichment analysis (GSEA) was performed to identify the signaling pathways involved. Reactome enrichment revealed that the top pathways were the immune system (NES=1.89, P= 0.0034) (Figure 2A), metabolism of lipids, B-cell receptor signaling pathways, HIF-1 signaling pathways, Fc gamma R-mediated phagocytosis, osteoclast differentiation, and Th1 and Th2 cell differentiation (Figures 2B-D; Supplementary Figures 5A-D). These results demonstrated that the immune response may play important roles in the pathogenesis of ICH.

Figure 2
www.frontiersin.org

Figure 2. Gene set enrichment analysis (GSEA) of representative pathways associated with genes significantly differentially expressed in ICH patients. (A) Enrichment plot showing the immune system pathway with the highest enrichment score. (B-D) Ridge plot (B), network diagram (C) and UpSet plot (D) showing the top 10 pathways in the annotated gene sets. Statistical significance levels were corrected for multiple testing using the Benjamini–Hochberg procedure.

Immune cell abundance is significantly altered in ICH patients according to immune cell infiltration analysis

To explore the immune cell composition after ICH, we used ImmuneCellAI to estimate the abundances of 24 immune cell types in the two cohorts. The abundances of 24 immune cells in each sample in the discovery and validation cohort are shown in Figures 3A, B. Comparisons of the proportions of 24 immune cell types between ICH patients and HTN controls in the discovery and validation cohorts are shown in Figures 3C, D. Overall, 11 immune cell types overlapped between the two cohorts, as shown in the boxplot in Figure 3E. The results revealed that the numbers of CD4 T cells, CD8 T cells, type 1 regulatory T (Tr1) cells, induced regulatory T (iTreg) cells, follicular T-helper (Tfh) cells, gamma-delta T cells, Th17 cells, and other immune cells, including B cells, monocytes, natural killer (NK) cells and neutrophils, significantly differed between ICH patients and HTN controls (P<0.05). Moreover, monocyte, neutrophil and Th17 cell numbers increased, but the numbers of other cell types decreased significantly after ICH. These results indicate that the proportion and distribution of immune cells are altered in ICH patients and may play crucial roles in the pathogenesis of ICH.

Figure 3
www.frontiersin.org

Figure 3. Immune cell infiltration analysis of the intracerebral hemorrhage (ICH) and hypertension (HTN) control groups by ImmuneCellAI. (A, B) Stacked bar plots showing the relative proportions of 24 immune cell subsets in the discovery (A) and validation (B) cohorts. (C, D) Violin diagrams showing the differences between ICH patients and HTN controls for 24 infiltrating immune cell types in the discovery (C) and validation (D) cohorts. (E) Violin diagrams showing 11 consistent differences in infiltrating immune cell types between ICH patients and HTN controls in both cohorts. The data were assessed using the Benjamini−Hochberg (BH) method.

Identification of immune-related DEGs via weighted gene coexpression network analysis and GSEA

To identify the specific modules associated with ICH, we performed WGCNA with the R package and performed network and module detection. Then, gene cluster dendrograms were constructed, and dynamic tree cutting was performed (Figure 4A). Fourteen modules of coexpressed genes were identified, and the correlations among different modules and differential groups were assessed to determine the most significant correlations. Notably, the dark gray (R = 0.52, P =8e-13) and black (R = -0.56, P =4e-15) modules were significantly different between the ICH group and the other groups on the basis of the criteria of absolute correlation > 0.5 and p value < 0.05 (Figure 4B). A total of 2029 key module genes related to ICH were identified. According to previous GSEA and immune cell infiltration analysis results, the immune system plays a vital role in the pathogenesis of ICH; subsequently, we identified 18 intersecting genes by overlapping 2029 key module genes related to ICH, 721 key genes related to the immune system and 367 DEGs (ICH vs. HTN and ICH vs. CTRL); these genes were identified as immune-related DEGs in ICH (Figure 4C). Furthermore, we analyzed the correlations between the expression levels of eighteen immune-related DEGs and immune cell infiltration in ICH patients via Spearman’s analysis. The results revealed that the expression levels of most immune-related DEGs were positively correlated with the abundance of Th17 cells, macrophages, monocytes and neutrophils and negatively correlated with Tfh cells, iTreg cells, Tr1 cells, CD4-naive cells, nTreg cells, CD8 T cells, MAIT cells, NK cells, CD4-T cells, and exhausted and gamma delta T cells; however, QPCTL exhibited the opposite pattern (P<0.05) (Figures 4D, E). Additionally, univariate Cox regression was performed to evaluate these eighteen immune-related DEGs as predictors of ICH prognosis, and the results revealed that the expression levels of IL18R1 (HR=2.82, 95% CI: 1.26–6.31, P=0.012), CKAP4 (HR=1.48, 95% CI: 1.07–2.05, P=0.018), PYGL (HR=1.37, 95% CI: 1.04–1.81, P=0.024), and CR1 (HR=0.55, 95% CI: 0.34–0.87, P=0.011) were significantly associated with the prognosis of ICH patients (Figure 4F).

Figure 4
www.frontiersin.org

Figure 4. Differentially expressed immune-related genes were screened via weighted gene coexpression network analysis (WGCNA) and GSEA. (A) Cluster dendrogram showing the gene modules among the four groups. Genes were divided into various modules by hierarchical clustering, and different colors represent different modules. (B) Heatmap of module-trait relationships. The black and dark gray modules were significantly associated with intracerebral hemorrhage (ICH) (R>0.5, p < 0.001). (C) Venn diagram showing eighteen differentially expressed immune-related genes (IRGs) identified via WGCNA and immune system genes (GSEA). (D, E) Heatmap (D) and correlation network (E) showing the relationships between eighteen IRGs and immune cell infiltration. (F) Univariate Cox regression analysis of eighteen IRGs for prognostic assessment. *p<0.05, **p<0.01.

Identification of immune-related diagnostic biomarkers for ICH with multiple classification algorithms

To further evaluate the eighteen immune-related DEGs as biomarkers for the diagnosis of ICH, we used the LASSO, SVM-RFE, XGBoost-RET and Boruta algorithms to rank the importance of the features according to their expression levels in all the samples. We reduced the number of dimensions through LASSO regression and selected 4 genes to construct a diagnostic model for ICH (Figures 5A, B). Similarly, we identified 5 genes from SVM-RFE and 10 genes from XGBoost-RET with the highest scores as optimal diagnostic tools for ICH (Figures 5C, D). Furthermore, we carried out feature selection via the Boruta algorithm and identified 11 genes as important, of which TLR8, CKAP4 and BCL6 were consistent with the top discriminatory biomarkers identified by the above three models (Figure 5E). Ultimately, 3 candidate genes (CKAP4, BCL6 and TLR8) were identified by intersection, and their diagnostic value was further assessed (Figure 5F).

Figure 5
www.frontiersin.org

Figure 5. Detection of immune-related diagnostic markers via four classification algorithms. (A) LASSO regression algorithm for screening diagnostic markers. The relationship between the binomial deviation and log(λ) was determined via 10-fold cross-validation (CV). (B) The coefficients of 18 feature genes are shown as log(λ). Different colors represent different genes. (C) SVM-RFE was used to screen biomarkers. (D) The XGBoost-RET algorithm was used to screen biomarkers. (E) The Boruta algorithm was used to screen biomarkers. (F) Venn diagram showing the intersection of the diagnostic markers obtained with the four algorithms. SVM-RFE: support vector machine recursive feature elimination, LASSO: least absolute shrinkage and selection operator, XGBoost: extreme gradient boosting.

Validation of candidate biomarkers via quantitative real-time polymerase chain reaction

To explore the ability of the three candidate genes as potential biomarkers to distinguish between ICH patients and other groups, we detected the expression levels of CKAP4, BCL6 and TLR8 in ICH patients, IS patients, HTN controls and CTRLs as FPKM values in both cohorts. The results demonstrated that CKAP4, BCL6 and TLR8 levels were significantly upregulated in ICH patients compared with those in IS patients, HTN controls and CTRLs but were not significantly different between IS patients and HTN controls (or CTRLs) in both the discovery (Figures 6A-C) and validation cohorts (Figures 6D-F). These three candidate biomarker genes were subsequently validated via RT−PCR in technical replicates of the four groups, and the results were consistent with those obtained via RNA sequencing (Figures 6G−I). These results indicate that CKAP4, BCL6 and TLR8 are specifically upregulated in patients with ICH and could be used as diagnostic biomarkers for ICH.

Figure 6
www.frontiersin.org

Figure 6. Validation of candidate gene expression levels via quantitative real-time polymerase chain reaction (RT−PCR). (A-F) The expression levels of three candidate genes in the discovery (A-C) and validation (D-F) cohorts among the four groups according to FPKM values. CKAP4 (A, D), BCL6 (B, E) and TLR8 (C, F). (G-I) RT−PCR results validated the expression levels of CKAP4 (G), BCL6 (H) and TLR8 (I) among 64 intracerebral hemorrhage (ICH) patients, 59 ischemic stroke (IS) patients, 60 hypertension (HTN) controls and 50 healthy controls (CTRLs). The data are presented as the median (interquartile range). *p<0.05, ***p<0.001, **p<0.01, ****p<0.0001. ns, no significance. Statistical significance was assessed using one-way ANOVA.

Diagnostic value of the three candidate biomarkers for ICH patients

Given that the above three genes were significantly differentially expressed according to both RNA sequencing and RT−PCR, we determined their diagnostic value with eight machine learning classification algorithms. The average performance values of the three candidate biomarkers for ICH based on accuracy and the area under the curve (AUC) in the training set, validation set and test set. Among these classifiers, the performance of GNB was superior to that of the other seven algorithms; the AUC for the model was 0.93 in the training set, 0.91 in the validation set, and 0.97 in the test set (Table 3). These results indicate that these three candidate biomarkers have great clinical value in the diagnosis of ICH.

Table 3
www.frontiersin.org

Table 3. Classification performance for the three-candidate RNA signatures in ICH patients.

Receiver operating characteristic (ROC) curve analysis was subsequently performed to explore the potential diagnostic value of the three candidate biomarkers for ICH. The AUCs of the CKAP4, BCL6 and TLR8 signatures for differentiating patients with ICH from HTN controls were 0.90, 0.82, and 0.92, respectively (Figure 7A). The AUC values of the CKAP4, BCL6 and TLR8 signatures for differentiating patients with ICH and CTRLs were 0.89, 0.84 and 0.95, respectively (Figure 7B); the AUC values of the CKAP4, BCL6 and TLR8 signatures for differentiating patients with ICH and IS patients were 0.80, 0.74 and 0.78, respectively (Figure 7C). The combination of CKAP4, BCL6 and TLR8 for differentiating patients with ICH from HTN controls, CTRLs and IS patients had AUC values of 0.93 (95% CI: 0.89–0.98), 0.95 (95% CI: 0.92–0.99) and 0.82 (95% CI: 0.74–0.89), respectively, with sensitivities of 81.3%, 84.4% and 75%, respectively, and specificities of 100%, 96% and 79.7%, respectively (Figures 7A−C). We used DCA to evaluate the clinical utility of the three candidates by qualifying the net benefit at a distinct threshold. The curve shows that the number of positive cases predicted by the model was close to the actual number of positive cases. As the risk threshold increased, the number of cases predicted by the model approached the actual number of cases. As expected, the DCA results revealed that CKAP4, BCL6 and TLR8 had similar clinical values in the diagnosis of ICH when these patients were differentiated from CTRLs and IS patients. Compared with CKAP4 and BCL6, TLR8 had greater clinical value (Figures 7D-F). On the basis of the DCA results, we further plotted clinical impact curves to evaluate the clinical utility of the diagnostic model. The clinical impact curves of the combination of the three biomarkers showed that the predicted probability coincided well with the actual probability of differentiating patients with ICH from HTN controls, CTRLs and IS patients (Figures 7G-I), suggesting that the diagnostic model had excellent predictive value. These results indicate that CKAP4, BCL6 and TLR8 are diagnostic biomarkers for ICH either individually or in combination.

Figure 7
www.frontiersin.org

Figure 7. Diagnostic value of the candidate genes in intracerebral hemorrhage (ICH) patients. (A) Receiver operating characteristic (ROC) curves were generated using the expression levels of CKAP4, BCL6 and TLR8 individually or in combination to differentiate patients with ICH from hypertension controls (n = 64 vs. 60). (B) ROC curves were generated using the expression levels of CKAP4, BCL6 and TLR8 to differentiate patients with ICH from healthy controls in all samples (n = 64 vs. 50). (C) ROC curves of CKAP4, BCL6 and TLR8 were generated to differentiate ICH patients from IS patients in all samples (n = 64 vs. 59). (D-F) DCA curves of CKAP4, BCL6 and TLR8 were generated to differentiate ICH patients from HTN controls (D), CTRLs (E) and IS patients (F) in all samples. (G-I) Clinical impact curves of the combination of CKAP4, BCL6 and TLR8 for discriminating ICH patients from HTN controls (G), CTRLs (H) and IS patients (I).

Correlation analysis of three candidate biomarkers and clinical characteristics

To further assess the associations of the three candidate genes with the clinical characteristics of ICH patients, we performed Spearman’s correlation analysis to evaluate the correlations between the expression levels of CKAP4, BCL6 and TLR8 and the clinical characteristics of ICH patients. CKAP4, BCL6 and TLR8 expression levels were positively correlated with white blood cell counts and glucose levels and negatively correlated with low-density lipoprotein cholesterol (LDL-C), triacylglycerol (TG), total cholesterol (TC), uric acid (UA) and apolipoprotein A (ApoA) levels in ICH patients (P < 0.05). TLR8 and CKAP4 were positively correlated with direct bilirubin (DBIL) levels and negatively correlated with sex and HDL-C and apolipoprotein B (ApoB) levels (P < 0.05). BCL6 expression levels were positively correlated with SBP and DBP and negatively correlated with red blood cell (RBC) counts and hemoglobin (Hb) levels (P < 0.05) (Supplementary Figure 6; Table 4). These results indicate that CKAP4, BCL6 and TLR8 may be involved in the pathogenesis of ICH.

Table 4
www.frontiersin.org

Table 4. Correlation between three RNA levels and baseline characteristic in ICH patients.

External and experimental validation of the expression levels of three candidate immune-related biomarkers of ICH

To further verify the clinical utility of these three candidate biomarkers for the diagnosis of patients with ICH, we recruited 20 patients with ICH and 20 CTRLs as another independent cohort for validation. CKAP4, BCL6 and TLR8 mRNA (Figures 8A-C) and protein (Figures 8D, E) levels were significantly higher in patients with ICH than in those with CTRLs (Supplementary Table 8). We subsequently detected the expression levels of CKAP4, BCL6 and TLR8 in ICH model mice and found that CKAP4, BCL6 and TLR8 were elevated in brain tissues after hemorrhage (Figures 8F-H), which was consistent with the RNA sequencing and RT−PCR results. Additionally, we further investigated the functional roles of these three genes in macrophages. RAW264.7 cells were polarized to the M1 or M2 macrophage phenotype via treatment with LPS plus IFN-γ or IL-4, respectively. M1 macrophages expressed the M1 markers TNFα, iNOS and CXCL10 (Figure 8I); in contrast, M2 macrophages expressed the M2 markers arginase 1 (Arg1) and CD206 (Figure 8J), suggesting that RAW264.7 cells were successfully polarized to M1 and M2 macrophages. We found that CKAP4 and TLR8 levels were decreased in M1 macrophages and increased in M2 macrophages, whereas BCL6 expression was increased in M1 macrophages but not in M2 macrophages at both the mRNA (Figures 8K−M) and protein levels (Figures 8N−Q). These results indicate that BCL6 may promote M1 macrophage polarization to activate neuroinflammation, whereas CKAP4 and TLR8 may promote M2 macrophage polarization to alleviate neuroinflammation after ICH, which may provide potential therapeutic targets for ICH.

Figure 8
www.frontiersin.org

Figure 8. External and experimental validation. Comparison of CKAP4 (A), BCL6 (B) and TLR8 (C) expression between ICH patients and CTRLs in an independent cohort (n=20 per group). (D, E) Western blot analysis of the expression levels of CKAP4, BCL6 and TLR8 in ICH patients and CTRLs in peripheral blood mononuclear cells (n=5 per group). (F) Representative images of HE staining in the ICH mice brain tissue and control groups. Scale bars, 100 µm. (G, H) Representative images of brain sections showing CKAP4, BCL6 and TLR8 staining in ICH mice and the control group; DAPI is shown in blue; BCL6 is shown in red; CKAP4 and TLR8 are shown in green. Scale bars, 100 µm. (I, J) The expression levels of M1 (TNF-α, iNOS, and CXCL10) and M2 (Arginase 1 and CD206) marker genes in RAW264.7 cells after stimulation with LPS (100 ng/ml), INF-γ (20 ng/ml) or IL4 (20 ng/ml) for 24 hours (n=3 per group). (K−M) The relative mRNA expression levels of CKAP4 (K), BCL6 (L) and TLR8 (M) in M1 and M2 macrophages (n=3-6 per group). (N-O) Western blot analysis of the expression levels of CKAP4, BCL6 and TLR8 in M1 and M2 macrophages. (P, Q) Relative CKAP4, BCL6 and TLR8 protein expression levels in M1 and M2 macrophages (n=3 per group). The data are presented as the means ± SDs. *p<0.05; **p<0.01, ***p<0.001. ns, no significance. Statistical significance was assessed using 2-tailed Student’s t test (A−E, H-J, P−Q) or one-way ANOVA (K−M).

Discussion

Given the high morbidity and mortality of stroke, no biomarkers for stroke are available for use in clinical practice, and the identification of potential biomarkers for discriminating between ICH and IS is essential. In this study, we investigated lncRNA and mRNA expression profiles in peripheral blood from ICH patients, IS patients, HTN controls and CTRLs via RNA sequencing. Functional analysis revealed that the most significantly enriched pathway of DEGs after ICH was involved in the immune system response. We further explored the immune cell composition and found that the abundances of 11 types of immune cells, including T-cell subtypes (Th1, Tfh, Th17, Tr1, iTreg, CD4+ T, CD8+ T and gamma delta T cells), B cells, neutrophils, monocytes and NK cells, were significantly altered after ICH, suggesting that the inflammatory response was involved in neuronal injury after ICH. In addition, by using multiple machine learning algorithms, we established an immune-related biomarker panel (CKAP4, BCL6, and TLR8) whose components are upregulated after ICH. This panel had AUC values of 0.93, 0.95 and 0.82, with sensitivities of 81.3%, 84.4% and 75%, respectively, and specificities of 100%, 96% and 79.7%, respectively, for discriminating ICH patients from HTN controls, CTRLs and IS patients; thus, this panel has potential diagnostic value for ICH.

Functional enrichment revealed that the immune system was the most significantly affected pathway in ICH patients, indicating that the immune system could be a potential therapeutic target related to the pathological effects of ICH. Neutrophils are the first leukocyte population to migrate into the brain after ICH, and their effects appear to be mainly deleterious in the context of brain injury (29). Evidence indicates that anti-polymorphonuclear neutrophil therapy administered via intravenous injection can reduce blood–brain barrier disruption and prevent neurological injury (30, 31). Monocytes also invade within 12 h after ICH, which indicates that they may be involved in secondary injury (29, 32). In our study, neutrophil and monocyte abundances increased during ICH, which is consistent with the findings of previous studies. Suppressed neutrophil and monocyte responses may alleviate neuroinflammation and brain injury, which could be therapeutic targets for ICH.

We also explored the composition of immune cells after ICH and reported that the proportions of T-cell subtypes among blood cells decreased after ICH. Previous studies using ImmuneCellAI analysis also suggested that CD4+ T-cell numbers are significantly decreased in ICH patients (33). T-cell subtypes with decreased abundance might be recruited to the brain and involved in inflammatory and anti-inflammatory responses. Reports have suggested that CD4+ T-cell numbers increase within 24 h after ICH and that CD8+ T-cell numbers increase approximately 3 to 4 days after ICH (34, 35); another study revealed that CD4+ T-cell numbers significantly decreased in ICH patients compared with controls, but there was no significant difference in CD8+ T-cell numbers between ICH patients and controls (33). There are two subtypes of CD4+ T cells, designated Th1 and Th2 cells, on the basis of cytokine secretion patterns (36). Th1 cell differentiation is induced by interleukin-2 and interferon-γ in response to proinflammatory signals (37, 38). Th2 cell differentiation is initiated by IL-4, IL-5 and IL-13, which stimulate B cells to produce abundant antibodies, which are involved in anti-inflammatory functions (39). Studies have demonstrated that Treg cells are beneficial after ICH and IS. Treg cell deficiency increases brain damage and neurological deterioration, and therapeutically increasing Treg cell numbers ameliorates ICH-induced inflammatory injury (40, 41). Transplanting neural stem cells reportedly increases Treg cell numbers and decreases gamma delta T-cell numbers to protect against brain injury in an ICH rat model (42).

Emerging evidence has revealed that machine learning algorithms have been developed to construct diagnostic models, which have become increasingly promising tools for analyzing large amounts of data, such as transcriptome sequences (43, 44). Moreover, specific variation in expression profiles was analyzed by combining GSEA and WGCNA (45) (46). In this study, we identified 18 potential immune-related biomarkers of ICH via GSEA and WGCNA of four groups. After four algorithms were used to select features, 3 overlapping candidate genes (CKAP4, BCL6 and TLR8) were subsequently validated via RT−PCR, and eight machine learning classification algorithms were used to determine their diagnostic value. The AUCs of this panel were 0.93, 0.95 and 0.82 for discriminating ICH patients from HTN controls, CTRLs and IS patients, respectively. CKAP4 is secreted into the serum by tumors and is expected to be a novel serological marker for the diagnosis of various cancers (47). The activation of PI3K–AKT signaling downstream of CKAP4 contributes to immune suppression in macrophages (48). BCL6 modulates the immune response and inflammation by regulating macrophage polarization and plays a critical role in autoimmune encephalomyelitis (49). Toll-like receptor 8 (TLR8) is expressed in different immune cell subtypes and can recognize single-stranded RNA and initiate early inflammatory responses (50). However, the roles of these three genes in ICH patients have not been reported. Our study is the first to show that CKAP4, BCL6 and TLR8 are immune-related biomarkers that differ between ICH patients and IS patients or controls and have promising diagnostic value in patients with ICH. We further found that CKAP4 and TLR8 were downregulated in M1 macrophages and upregulated in M2 macrophages, whereas BCL6 expression was upregulated in M1 macrophages but not in M2 macrophages, indicating that BCL6 may promote M1 macrophage polarization to aggravate neuroinflammation, whereas CKAP4 and TLR8 may promote M2 macrophage polarization to alleviate neuroinflammation after ICH, which may be potential therapeutic targets for ICH.

There are some limitations of our study. First, we focused on one timepoint after ICH, and multiple time points for multilayer distribution analysis should be used to control for confounding factors. A larger multicenter study with more individuals should be conducted for external validation. Second, to ensure the accuracy of the identified RNA panel in guiding clinical treatment, single cells need to be isolated and analyzed via single-cell RNA sequencing, and the functions and mechanism of the anti-inflammatory effects of CKAP4 and TLR8, as well as the proinflammatory effects of BCL6 in ICH patients, need to be further explored in future studies. Third, the eight machine learning classifiers were applied to small training datasets because of the limited sample size; however, the best performing classifier was comprehensively validated and then confirmed with a series of indices. Therefore, the selected features are considered significant.

Conclusion

Our study comprehensively compares the transcriptome profiles of ICH patients, IS patients, HTN controls and CTRLs to aid in the early differentiation of patients with ICH from those with IS. Eleven immune cell types with significantly altered abundance after ICH were identified; these findings might provide useful insight into the pathogenesis and therapeutic approaches for patients with ICH. Furthermore, an RNA panel (CKAP4, BCL6, and TLR8) was developed as a potential biomarker for ICH detection. This study provides a new perspective on the pathogenesis of ICH and a more effective diagnostic tool.

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: HRA001807 (GSA; https://ngdc.cncb.ac.cn/gsa-human/browse/HRA001807).

Ethics statement

The studies involving humans were approved by Human Ethics Committee of Fuwai Hospital(Approval No. 2016-732). The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study. Ethical approval was not required for the studies on animals in accordance with the local legislation and institutional requirements because only commercially available established cell lines were used.

Author contributions

CB: Writing – original draft, Writing – review & editing, Funding acquisition. XL: Validation, Writing – review & editing. FW: Visualization, Writing – review & editing. YS: Investigation, Writing – review & editing. JW: Data curation, Writing – review & editing. JL: Investigation, Writing – review & editing. XH: Software, Writing – review & editing. LZ: Resources, Writing – review & editing. YY: Supervision, Writing – review & editing. JL: Funding acquisition, Writing – review & editing.

Funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This study was supported by the National Natural Science Foundation of China (82300488 to CB and 81972026 to JL).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fimmu.2024.1421942/full#supplementary-material

References

1. Collaborators GBDS. Global, regional, and national burden of stroke and its risk factors, 1990-2019: a systematic analysis for the Global Burden of Disease Study 2019. Lancet Neurol. (2021) 20:795–820. doi: 10.1016/S1474-4422(21)00252-0

PubMed Abstract | Crossref Full Text | Google Scholar

2. Saini V, Guada L, Yavagal DR. Global epidemiology of stroke and access to acute ischemic stroke interventions. Neurology. (2021) 97:S6–S16. doi: 10.1212/WNL.0000000000012781

PubMed Abstract | Crossref Full Text | Google Scholar

3. Collaborators GBDS. Global, regional, and national burden of stroke, 1990-2016: a systematic analysis for the Global Burden of Disease Study 2016. Lancet Neurol. (2019) 18:439–58. doi: 10.1016/S1474-4422(18)30499-X

PubMed Abstract | Crossref Full Text | Google Scholar

4. Qureshi AI, Mendelow AD, Hanley DF. Intracerebral haemorrhage. Lancet. (2009) 373:1632–44. doi: 10.1016/S0140-6736(09)60371-8

PubMed Abstract | Crossref Full Text | Google Scholar

5. Wang W, Jiang B, Sun H, Ru X, Sun D, Wang L, et al. Prevalence, incidence, and mortality of stroke in China: results from a nationwide population-Based survey of 480 687 adults. Circulation. (2017) 135:759–71. doi: 10.1161/CIRCULATIONAHA.116.025250

PubMed Abstract | Crossref Full Text | Google Scholar

6. Feigin VL, Roth GA, Naghavi M, Parmar P, Krishnamurthi R, Chugh S, et al. Global burden of stroke and risk factors in 188 countries, during 1990–2013: a systematic analysis for the Global Burden of Disease Study 2013. Lancet Neurol. (2016) 15:913–24. doi: 10.1016/S1474-4422(16)30073-4

PubMed Abstract | Crossref Full Text | Google Scholar

7. Xi G, Keep RF, Hoff JT. Mechanisms of brain injury after intracerebral haemorrhage. Lancet Neurol. (2006) 5:53–63. doi: 10.1016/S1474-4422(05)70283-0

PubMed Abstract | Crossref Full Text | Google Scholar

8. Ariesen MJ, Claus SP, Rinkel GJ, Algra A. Risk factors for intracerebral hemorrhage in the general population: a systematic review. Stroke. (2003) 34:2060–5. doi: 10.1161/01.STR.0000080678.09344.8D

PubMed Abstract | Crossref Full Text | Google Scholar

9. Sidorov E, Sanghera DK, Vanamala JKP. Biomarker for ischemic stroke using metabolome: A clinician perspective. J Stroke. (2019) 21:31–41. doi: 10.5853/jos.2018.03454

PubMed Abstract | Crossref Full Text | Google Scholar

10. Montaner J, Ramiro L, Simats A, Tiedt S, Makris K, Jickling GC, et al. Multilevel omics for the discovery of biomarkers and therapeutic targets for stroke. Nat Rev Neurol. (2020) 16:247–64. doi: 10.1038/s41582-020-0350-6

PubMed Abstract | Crossref Full Text | Google Scholar

11. Kamtchum-Tatuene J, Jickling GC. Blood biomarkers for stroke diagnosis and management. Neuromolecular Med. (2019) 21:344–68. doi: 10.1007/s12017-019-08530-0

PubMed Abstract | Crossref Full Text | Google Scholar

12. Njoku K, Pierce A, Chiasserini D, Geary B, Campbell AE, Kelsall J, et al. Detection of endometrial cancer in cervico-vaginal fluid and blood plasma: leveraging proteomics and machine learning for biomarker discovery. EBioMedicine. (2024) 102:105064. doi: 10.1016/j.ebiom.2024.105064

PubMed Abstract | Crossref Full Text | Google Scholar

13. Cheng X, Ander BP, Jickling GC, Zhan X, Hull H, Sharp FR, et al. MicroRNA and their target mRNAs change expression in whole blood of patients after intracerebral hemorrhage. J Cereb Blood Flow Metab. (2020) 40:775–86. doi: 10.1177/0271678X19839501

PubMed Abstract | Crossref Full Text | Google Scholar

14. Carmona-Mora P, Ander BP, Jickling GC, Dykstra-Aiello C, Zhan X, Ferino E, et al. Distinct peripheral blood monocyte and neutrophil transcriptional programs following intracerebral hemorrhage and different etiologies of ischemic stroke. J Cereb Blood Flow Metab. (2021) 41:1398–416. doi: 10.1177/0271678X20953912

PubMed Abstract | Crossref Full Text | Google Scholar

15. Yu Z, Hu E, Cai Y, Zhu W, Chen Q, Li T, et al. mRNA and lncRNA co-expression network in mice of acute intracerebral hemorrhage. Front Mol Neurosci. (2023) 16:1166875. doi: 10.3389/fnmol.2023.1166875

PubMed Abstract | Crossref Full Text | Google Scholar

16. Li L, Wang P, Zhao H, Luo Y. Noncoding RNAs and intracerebral hemorrhage. CNS Neurol Disord Drug Targets. (2019) 18:205–11. doi: 10.2174/1871527318666190204102604

PubMed Abstract | Crossref Full Text | Google Scholar

17. Zuo L, Zhang L, Zu J, Wang Z, Han B, Chen B, et al. Circulating circular RNAs as biomarkers for the diagnosis and prediction of outcomes in acute ischemic stroke. Stroke. (2020) 51:319–23. doi: 10.1161/STROKEAHA.119.027348

PubMed Abstract | Crossref Full Text | Google Scholar

18. Zhang X, Hamblin MH, Yin KJ. Noncoding RNAs and stroke. Neuroscientist. (2019) 25:22–6. doi: 10.1177/1073858418769556

PubMed Abstract | Crossref Full Text | Google Scholar

19. Bai C, Hao X, Zhou L, Sun Y, Song L, Wang F, et al. Machine learning-based identification of the novel circRNAs circERBB2 and circCHST12 as potential biomarkers of intracerebral hemorrhage. Front Neurosci. (2022) 16:1002590. doi: 10.3389/fnins.2022.1002590

PubMed Abstract | Crossref Full Text | Google Scholar

20. Bai C, Liu T, Sun Y, Li H, Xiao N, Zhang M, et al. Identification of circular RNA expression profiles and potential biomarkers for intracerebral hemorrhage. Epigenomics. (2021) 13:379–95. doi: 10.2217/epi-2020-0432

PubMed Abstract | Crossref Full Text | Google Scholar

21. Hankey GJ. Stroke. Lancet. (2017) 389:641–54. doi: 10.1016/S0140-6736(16)30962-X

PubMed Abstract | Crossref Full Text | Google Scholar

22. Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. (2014) 15:550. doi: 10.1186/s13059-014-0550-8

PubMed Abstract | Crossref Full Text | Google Scholar

23. Robinson MD, McCarthy DJ, Smyth GK. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. (2010) 26:139–40. doi: 10.1093/bioinformatics/btp616

PubMed Abstract | Crossref Full Text | Google Scholar

24. Yu G, Wang LG, Han Y, He QY. clusterProfiler: an R package for comparing biological themes among gene clusters. OMICS. (2012) 16:284–7. doi: 10.1089/omi.2011.0118

PubMed Abstract | Crossref Full Text | Google Scholar

25. Langfelder P, Horvath S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinf. (2008) 9:559. doi: 10.1186/1471-2105-9-559

Crossref Full Text | Google Scholar

26. Miao YR, Zhang Q, Lei Q, Luo M, Xie GY, Wang H, et al. ImmuCellAI: A unique method for comprehensive T-cell subsets abundance prediction and its application in cancer immunotherapy. Adv Sci (Weinh). (2020) 7:1902880. doi: 10.1002/advs.201902880

PubMed Abstract | Crossref Full Text | Google Scholar

27. Xiao N, Liu TL, Li H, Xu HC, Ge J, Wen HY, et al. Low serum uric acid levels promote hypertensive intracerebral hemorrhage by disrupting the smooth muscle cell-Elastin contractile unit and upregulating the erk1/2-MMP axis. Transl Stroke Res. (2020) 11:1077–94. doi: 10.1007/s12975-020-00791-3

PubMed Abstract | Crossref Full Text | Google Scholar

28. Li H, Xu H, Wen H, Liu T, Sun Y, Xiao N, et al. Overexpression of LH3 reduces the incidence of hypertensive intracerebral hemorrhage in mice. J Cereb Blood Flow Metab. (2019) 39:547–61. doi: 10.1177/0271678X18815791

PubMed Abstract | Crossref Full Text | Google Scholar

29. Mracsko E, Veltkamp R. Neuroinflammation after intracerebral hemorrhage. Front Cell Neurosci. (2014) 8:388. doi: 10.3389/fncel.2014.00388

PubMed Abstract | Crossref Full Text | Google Scholar

30. Moxon-Emre I, Schlichter LC. Neutrophil depletion reduces blood-brain barrier breakdown, axon injury, and inflammation after intracerebral hemorrhage. J Neuropathol Exp Neurol. (2011) 70:218–35. doi: 10.1097/NEN.0b013e31820d94a5

PubMed Abstract | Crossref Full Text | Google Scholar

31. Gautier S, Ouk T, Petrault O, Caron J, Bordet R. Neutrophils contribute to intracerebral haemorrhages after treatment with recombinant tissue plasminogen activator following cerebral ischaemia. Br J Pharmacol. (2009) 156:673–9. doi: 10.1111/j.1476-5381.2009.00068.x

PubMed Abstract | Crossref Full Text | Google Scholar

32. Sansing LH, Harris TH, Welsh FA, Kasner SE, Hunter CA, Kariko K. Toll-like receptor 4 contributes to poor outcome after intracerebral hemorrhage. Ann Neurol. (2011) 70:646–56. doi: 10.1002/ana.22528

PubMed Abstract | Crossref Full Text | Google Scholar

33. Stamova B, Ander BP, Jickling G, Hamade F, Durocher M, Zhan X, et al. The intracerebral hemorrhage blood transcriptome in humans differs from the ischemic stroke and vascular risk factor control blood transcriptomes. J Cereb Blood Flow Metab. (2019) 39:1818–35. doi: 10.1177/0271678X18769513

PubMed Abstract | Crossref Full Text | Google Scholar

34. Xue M, Del Bigio MR. Comparison of brain cell death and inflammatory reaction in three models of intracerebral hemorrhage in adult rats. J Stroke Cerebrovasc Dis. (2003) 12:152–9. doi: 10.1016/S1052-3057(03)00036-3

PubMed Abstract | Crossref Full Text | Google Scholar

35. Loftspring MC, McDole J, Lu A, Clark JF, Johnson AJ. Intracerebral hemorrhage leads to infiltration of several leukocyte populations with concomitant pathophysiological changes. J Cereb Blood Flow Metab. (2009) 29:137–43. doi: 10.1038/jcbfm.2008.114

PubMed Abstract | Crossref Full Text | Google Scholar

36. Mosmann TR, Cherwinski H, Bond MW, Giedlin MA, Coffman RL. Two types of murine helper T cell clone. I. Definition according to profiles of lymphokine activities and secreted proteins. J Immunol. (1986) 136:2348–57. doi: 10.4049/jimmunol.136.7.2348

PubMed Abstract | Crossref Full Text | Google Scholar

37. Iadecola C, Anrather J. The immunology of stroke: from mechanisms to translation. Nat Med. (2011) 17:796–808. doi: 10.1038/nm.2399

PubMed Abstract | Crossref Full Text | Google Scholar

38. Hendrix S, Nitsch R. The role of T helper cells in neuroprotection and regeneration. J Neuroimmunol. (2007) 184:100–12. doi: 10.1016/j.jneuroim.2006.11.019

PubMed Abstract | Crossref Full Text | Google Scholar

39. Walker JA, McKenzie ANJ. TH2 cell development and function. Nat Rev Immunol. (2018) 18:121–33. doi: 10.1038/nri.2017.118

PubMed Abstract | Crossref Full Text | Google Scholar

40. Liesz A, Suri-Payer E, Veltkamp C, Doerr H, Sommer C, Rivest S, et al. Regulatory T cells are key cerebroprotective immunomodulators in acute experimental stroke. Nat Med. (2009) 15:192–9. doi: 10.1038/nm.1927

PubMed Abstract | Crossref Full Text | Google Scholar

41. Zhou K, Zhong Q, Wang YC, Xiong XY, Meng ZY, Zhao T, et al. Regulatory T cells ameliorate intracerebral hemorrhage-induced inflammatory injury by modulating microglia/macrophage polarization through the IL-10/GSK3beta/PTEN axis. J Cereb Blood Flow Metab. (2017) 37:967–79. doi: 10.1177/0271678X16648712

PubMed Abstract | Crossref Full Text | Google Scholar

42. Gao L, Lu Q, Huang LJ, Ruan LH, Yang JJ, Huang WL, et al. Transplanted neural stem cells modulate regulatory T, gammadelta T cells and corresponding cytokines after intracerebral hemorrhage in rats. Int J Mol Sci. (2014) 15:4431–41. doi: 10.3390/ijms15034431

PubMed Abstract | Crossref Full Text | Google Scholar

43. Yang Y, Cao Y, Han X, Ma X, Li R, Wang R, et al. Revealing EXPH5 as a potential diagnostic gene biomarker of the late stage of COPD based on machine learning analysis. Comput Biol Med. (2023) 154:106621. doi: 10.1016/j.compbiomed.2023.106621

PubMed Abstract | Crossref Full Text | Google Scholar

44. Komorowski M, Green A, Tatham KC, Seymour C, Antcliffe D. Sepsis biomarkers and diagnostic tools with a focus on machine learning. EBioMedicine. (2022) 86:104394. doi: 10.1016/j.ebiom.2022.104394

PubMed Abstract | Crossref Full Text | Google Scholar

45. Mei S, Shao Y, Fang Y, Lu J, Zheng J, Xu S, et al. The changes of leukocytes in brain and blood after intracerebral hemorrhage. Front Immunol. (2021) 12:617163. doi: 10.3389/fimmu.2021.617163

PubMed Abstract | Crossref Full Text | Google Scholar

46. Zhao X, Zhang L, Wang J, Zhang M, Song Z, Ni B, et al. Identification of key biomarkers and immune infiltration in systemic lupus erythematosus by integrated bioinformatics analysis. J Transl Med. (2021) 19:35. doi: 10.1186/s12967-020-02698-x

PubMed Abstract | Crossref Full Text | Google Scholar

47. Li SX, Li J, Dong LW, Guo ZY. Cytoskeleton-associated protein 4, a promising biomarker for tumor diagnosis and therapy. Front Mol Biosci. (2020) 7:552056. doi: 10.3389/fmolb.2020.552056

PubMed Abstract | Crossref Full Text | Google Scholar

48. Shi T, Zhang Y, Wang Y, Song X, Wang H, Zhou X, et al. DKK1 promotes tumor immune evasion and impedes anti-PD-1 treatment by inducing immunosuppressive macrophages in gastric cancer. Cancer Immunol Res. (2022) 10:1506–24. doi: 10.1158/2326-6066.CIR-22-0218

PubMed Abstract | Crossref Full Text | Google Scholar

49. Li Q, Zhou L, Wang L, Li S, Xu G, Gu H, et al. Bcl6 modulates innate immunity by controlling macrophage activity and plays critical role in experimental autoimmune encephalomyelitis. Eur J Immunol. (2020) 50:525–36. doi: 10.1002/eji.201948299

PubMed Abstract | Crossref Full Text | Google Scholar

50. Yang C, Bachu M, Du Y, Brauner C, Yuan R, Ah Kioon MD, et al. CXCL4 synergizes with TLR8 for TBK1-IRF5 activation, epigenomic remodeling and inflammatory response in human monocytes. Nat Commun. (2022) 13:3426. doi: 10.1038/s41467-022-31132-7

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: intracerebral hemorrhage, RNA sequencing, immune cells, biomarkers, machine learning

Citation: Bai C, Liu X, Wang F, Sun Y, Wang J, Liu J, Hao X, Zhou L, Yuan Y and Liu J (2024) Identification of immune-related biomarkers for intracerebral hemorrhage diagnosis based on RNA sequencing and machine learning. Front. Immunol. 15:1421942. doi: 10.3389/fimmu.2024.1421942

Received: 23 April 2024; Accepted: 16 August 2024;
Published: 30 August 2024.

Edited by:

Baohui Xu, Stanford University, United States

Reviewed by:

Ivana Kawikova, University of Hartford, United States
Juan Chen, Huazhong University of Science and Technology, China
Rong Jin, Penn State Milton S. Hershey Medical Center, United States

Copyright © 2024 Bai, Liu, Wang, Sun, Wang, Liu, Hao, Zhou, Yuan and Liu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Jiayun Liu, amlheXVuQGZtbXUuZWR1LmNu; Yu Yuan, YXV0b2ZhbjEwMEBhbGl5dW4uY29t; Lei Zhou, aWFtd29sZi1zbnpxQDE2My5jb20=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.