ORIGINAL RESEARCH article

Front. Bioeng. Biotechnol., 05 March 2020

Sec. Computational Genomics

Volume 8 - 2020 | https://doi.org/10.3389/fbioe.2020.00167

Identification and Analysis of Glioblastoma Biomarkers Based on Single Cell Sequencing

  • 1. Department of Neurosurgery, Xiangya Hospital, Central South University, Changsha, China

  • 2. Department of Clinical Pharmacology, Xiangya Hospital, Central South University, Changsha, China

  • 3. Department of Rehabilitation, The Second Xiangya Hospital, Central South University, Changsha, China

  • 4. Department of Psychiatry, The Second People’s Hospital of Hunan University of Chinese Medicine, Changsha, China

Article metrics

View details

32

Citations

10,9k

Views

4k

Downloads

Abstract

Glioblastoma (GBM) is one of the most common and aggressive primary adult brain tumors. Tumor heterogeneity poses a great challenge to the treatment of GBM, which is determined by both heterogeneous GBM cells and a complex tumor microenvironment. Single-cell RNA sequencing (scRNA-seq) enables the transcriptomes of great deal of individual cells to be assayed in an unbiased manner and has been applied in head and neck cancer, breast cancer, blood disease, and so on. In this study, based on the scRNA-seq results of infiltrating neoplastic cells in GBM, computational methods were applied to screen core biomarkers that can distinguish the discrepancy between GBM tumor and pericarcinomatous environment. The gene expression profiles of GBM from 2343 tumor cells and 1246 periphery cells were analyzed by maximum relevance minimum redundancy (mRMR). Upon further analysis of the feature lists yielded by the mRMR method, 31 important genes were extracted that may be essential biomarkers for GBM tumor cells. Besides, an optimal classification model using a support vector machine (SVM) algorithm as the classifier was also built. Our results provided insights of GBM mechanisms and may be useful for GBM diagnosis and therapy.

Introduction

Glioblastoma (GBM), with an annual incidence of 3.19 per 100,000, maintains the most common and aggressive primary adult brain tumor (Stupp et al., 2007, 2017; Chinot et al., 2014; Gilbert et al., 2014; Ostrom et al., 2016). Currently, the standard therapeutic regimen has been established, including surgical resection, followed by radiotherapy with concurrent chemotherapy (temozolomide), then followed by maintenance therapy (temozolomide for 6–12 months) (Stupp et al., 2005). However, the diffuse nature of GBMs makes it invariably recur after treatment, rendering local therapies invalid, because the migrating GBM cells outside of the neoplasm core are usually unaffected by local therapies and hence cause recurrence of GBMs (Darmanis et al., 2017). The mean disease-free survival is just over 6 months and the mean overall survival also remains gloomy, with an approximately 25% 2-year survival rate after diagnosis and a 5–10% 5-year survival rate (Stupp et al., 2005, 2017; Das and Marsden, 2013).

Tumor heterogeneity poses a great challenge to the treatment of GBM, which is determined by both heterogeneous GBM cells and a complex tumor microenvironment. It is critical important for researchers to understand how different types of GBM cells interact with neoplasm cells through profiling of different types of cell from cell population in paraneoplastic environment, as well as identifying the lineage and phenotypes (Darmanis et al., 2017). Verhaak et al. (2010) has proved bulk tumor sequencing methods were useful in generating classification schemas of GBM subtypes, but the heterogeneity of GBM was not unveiled in essence (Cancer Genome Atlas Research Network, 2008). Until recently, RNA profiling was limited to ensemble-based approaches, averaging over bulk cell populations. Therefore, the advent of single-cell RNA sequencing (scRNA-seq) enables the transcriptomes of great deal of individual cells to be assayed in an unbiased manner (Stegle et al., 2015) and has been applied in head and neck cancer (Puram et al., 2017), breast cancer (Bajikar et al., 2017), blood disease (Zhao et al., 2017), and so on. Patel et al. (2014) profiled 430 cells from five GBM patients using scRNA-seq and described inter-patient variation and molecular diversity of tumor cells within individual GBM patients. The diversities of GBM cells within tumors are responsible for cancer progression and finally result in treatment failure.

Currently, in order to improve future treatment options, an increasing number of researchers have focused on the targeted agents or genes (Liu et al., 2013; Xiao et al., 2014; Li et al., 2018). Furnari et al. (2007) have identified genetic molecular mechanisms in GBM patients: (1) dysregulation of growth factor signaling through amplification and mutational activation of receptor tyrosine kinase (RTK) genes; (2) activation of the phosphatidyl inositol 3-kinase (PI3K) pathway; and (3) deactivation of the p53 and retinoblastoma tumor suppressor pathways. Moreover, four distinct GBM subclasses, including neural, proneural (PGFRA/IDH1 events), classical (focal EGFR events), and mesenchymal (NF1 mutation and loss), were defined by gene expression studies from The Cancer Genome Atlas (TCGA) (Verhaak et al., 2010), which also found the majority of GBM neoplasms had abnormalities in the pathways (RB, TP53, and RTK) through projecting copy number and mutation data on these pathways, revealing that this is a crucial step for GBM pathogenesis. Apart from such researches focused on tumor or microenvironment, many studies analyzed the gene expression of immune cells in GBM via scRNA-seq. Muller et al. (2017) identified 66 new gene sets which can be applied as biomarkers (such as P2RY12, CD49D, and HLA-DRA) to distinguish the different lineages of the macrophage cell subsets.

In this study, based on the scRNA-seq results of infiltrating neoplastic cells in GBM, computational methods were applied to screen core biomarkers that can distinguish the discrepancy between GBM tumor and pericarcinomatous environment. The gene expression profiles of GBM from 2343 tumor cells and 1246 periphery cells were analyzed by maximum relevance minimum redundancy (mRMR) (Peng et al., 2005). Upon further analysis of the feature lists yielded by the mRMR method, 31 important genes were extracted that may be essential biomarkers for GBM tumor cells. Besides, an optimal classification model using a support vector machine (SVM) algorithm (Ding and Dubchak, 2001) as the classifier was also built.

Materials and Methods

The Single Cell Gene Expression Profiles of Tumor and Surrounding Tissues

We download the single cell gene expression profiles of 2343 cells of tumor core and 1246 cells of surrounding tissue from Gene Expression Omnibus (GEO) with accession number of GSE84465 (Darmanis et al., 2017). 23,460 genes were measured using Illumina NextSeq 500. Within each sample, we counted the number of expressed genes, i.e., the number of genes with mapped reads. The average number of expressed genes in each sample was 2,581. Our goal is to discriminate the 2343 tumor cells (positive samples) and 1246 surrounding cells (negative samples).

The mRMR Ranking of Discriminative Genes

There have been many statistics methods for identifying the differentially expressed genes (DEGs). But these methods did not consider the relationships between genes. Usually, the number of DEGs was too large to apply as biomarker. Therefore, we adopted the information theory-based mRMR (minimal Redundancy Maximal Relevance) method (Peng et al., 2005) to overcome this problem. The mRMR method not only considers the associations between genes and samples, but also the redundancy between genes. If several genes are similar, only the most representative gene will be selected. This approach has been proven to be effective and has been widely used for many biomedical feature selection problems (Niu et al., 2013; Zhao et al., 2013; Zhou et al., 2015; Zhang et al., 2016; Liu et al., 2017), especially in single cell RNA-Seq analysis (Zhang et al., 2019). The sample size of single cell data was large and the gene expression was spare. It was easy to get too many redundant significant genes using traditional statistical based method, such as t-test. Therefore, the mRMR was suitable for analyzing single cell data to get small number of non-redundant biomarkers.

Let’s describe the method mathematically. All genes, selected genes, to be selected genes can be represented as Ω, Ωs, and Ωt, respectively. The relevance of gene g from Ωt with tissue type t can be measured with mutual information (I) (Sun et al., 2012; Huang and Cai, 2013):

And the redundancy R of the gene g with the selected genes in Ωs are

The goal of this algorithm is to get the gene gj from Ωt that has maximum relevance with tissue type t and minimum redundancy with the selected genes in Ωs, i.e., maximize the mRMR function

The evaluation procedure will be continued for N rounds, and all the genes will be ranked as a list

The index h reflects the trade-off between relevance with tissue type and redundancy with selected genes. The smaller index h is, the better discriminating power the gene has.

The Single Cell GBM Biomarker Optimization

Based on the top 100 mRMR genes, we constructed 100 SVM classifiers and applied an incremental feature selection (IFS) method (Jiang et al., 2013; Li et al., 2014; Shu et al., 2014; Zhang et al., 2014, 2015) to identify the optimal number of genes as biomarker. The svm function from R package e10171 was used to implement the SVM method. Each candidate gene set included the top k genes in the mRMR list.

We used leave-one-out cross validation (LOOCV) (Cui et al., 2013; Yang et al., 2014) to evaluate the prediction performance of each SVM classifier. During LOOCV, all of the N samples were tested one-by-one. In each round, one sample was used for testing of the prediction model trained with all the other N−1 samples. After N rounds, all samples were tested one time, and the predicted tissue types were compared with the actual tissue types.

Since the positive and negative sample sizes were imbalance and Mathew’s correlation coefficient (MCC) can consider both sensitivity and specificity (Huang et al., 2015), MCC was used in IFS optimization. MCC can be calculated as follows:

where TP, TN, FP, and FN stand for true positive, true negative, false positive, and false negative, respectively.

Based on the LOOCV MCC of each candidate gene set, an IFS curve can be plotted. The x-axis denoted the number of top genes that were used in the SVM classifier, and the y-axis denoted the LOOCV MCCs of the SVM classifiers. Based on the IFS curve, we can choose the right number of genes which had a good prediction performance as final biomarkers.

Results and Discussion

The Discriminative Importance of Genes

We applied mRMR algorithm to evaluate the discriminative importance of features iteratively. We want to find the features that were strongly associated with samples groups and were not redundant with other selected features. Using the mRMR method, we identified the top 100 most important genes. These genes were listed in Supplementary Table S1.

The Optimal GBM Biomarker Genes Selected With IFS Method

After we got the top 100 mRMR genes, we still did not know how many genes should be selected. To optimize the selected biomarker genes, we adopted IFS method. Each time, we added one feature into the previous feature set and got a new feature set. Then SVM classifiers were built to predict each sample’s labels during LOOCV. The IFS curve with the number of genes as x-axis and the prediction performance (LOOCV MCC) as y-axis were plotted in Figure 1. The peak MCC was 0.812 when 31 genes were used. These 31 genes were selected as optimal GBM biomarker genes. The 31 genes were listed in Table 1. The confusion matrix of the 31 genes were given in Table 2. The sensitivity, specificity, and accuracy were 0.948, 0.855, and 0.915, respectively.

FIGURE 1

FIGURE 1

The IFS curve of the top 100 mRMR genes. The x-axis was the number of genes and the y-axis was the prediction performance, i.e., LOOCV MCC. The peak MCC was 0.812 when 31 genes were used. These 31 genes were selected as optimal GBM biomarker genes.

TABLE 1

RankGeneRankGene
1TMSB4X17VIM
2IPCEF118ATP1A2
3MTSS119RPL41
4S100A1020EGR3
5HTRA121OMG
6DHRS922LDHA
7TPI123P2RY12
8SNX2224SPOCK1
9FCGBP25NAMPT
10TMSB1026C1QL2
11CCL327PTN
12SLC6A128CCL4
13SMOC129PDZD2
14SEC61G30LGALS1
15TGFBI31CLDN10
16CDR1

The 31 selected GBM biomarker genes.

TABLE 2

Predicted GBMPredicted non-GBM
Actual GBM2220123
Actual non-GBM1811065

The confusion matrix of the 31 selected genes.

Since the tumor tissues are usually a mixture of tumor cells and normal cells, the tumor purity may cause the misclassifications. To check this, Figures 2A,B showed the t-distributed stochastic neighbor embedding (t-SNE) plots of predicted GBM cells and predicted non-GBM cells, respectively. In Figure 2A, it can be seen that the false positive samples (red dots) and the true positive samples (black dots) were mixed and they were difficult to classify. Similarly, in Figure 2B, it can be seen that the false negative samples (black dots) and the true negative samples (red dots) were mixed. These t-SNE plots suggested that the GBM tissues may contain non-GBM cells and the non-GBM tissues may contain GBM cells, but most cells from the corresponding tissue were similar and the machine learning algorithm we used can get the robust single cell biomarkers even when there were tissue purity issues.

FIGURE 2

FIGURE 2

The t-SNE plots of predicted GBM cells and predicted non-GBM cells. (A) The t-SNE plots of predicted GBM cells. It can be seen that the false positive samples (red dots) and the true positive samples (black dots) were mixed and they were difficult to classify. (B) The t-SNE plots of predicted non-GBM cells. It can be seen that the false negative samples (black dots) and the true negative samples (red dots) were mixed. These t-SNE plots suggested that the GBM tissues may contain non-GBM cells and the non-GBM tissues may contain GBM cells, but most cells from the corresponding tissue were similar and the machine learning algorithm we used can get the robust single cell biomarkers even when there were tissue purity issues.

The Biological Functions of the Selected Genes

Upon analysis by the mRMR method, 31 important genes were extracted that may be essential biomarkers of GBM. We did Gene Ontology (GO) enrichment analysis of these 31 genes. The GO enrichment results were given in Table 3. It can be seen that their main function was cell adhesion and their main subcellular location was extracellular.

TABLE 3

GO termFDRP-valueGenes
GO:0007155 cell adhesion0.00688.26E−07EGR3, LGALS1, OMG, PTN, S100A10, CCL4, SPOCK1, TGFBI, CLDN10, MTSS1, PDZD2, P2RY12
GO:0022610 biological adhesion0.00688.74E−07EGR3, LGALS1, OMG, PTN, S100A10, CCL4, SPOCK1, TGFBI, CLDN10, MTSS1, PDZD2, P2RY12
GO:0031012 extracellular matrix0.00291.57E−06LGALS1, OMG, HTRA1, PTN, SPOCK1, TGFBI, VIM, SMOC1
GO:0005615 extracellular space0.01071.56E−05LGALS1, OMG, HTRA1, PTN, CCL3, CCL4, SPOCK1, TGFBI, TMSB4X, TPI1, NAMPT
GO:0005576 extracellular region0.01071.87E−05ATP1A2, LDHA, LGALS1, OMG, HTRA1, PTN, S100A10, CCL3, CCL4, SPOCK1, TGFBI, TMSB4X, TPI1, VIM, FCGBP, NAMPT, PDZD2, SMOC1, C1QL2
GO:0005578 proteinaceous extracellular matrix0.01072.30E−05LGALS1, OMG, PTN, SPOCK1, TGFBI, SMOC1
GO:0044421 extracellular region part0.01082.89E−05ATP1A2, LDHA, LGALS1, OMG, HTRA1, PTN, S100A10, CCL3, CCL4, SPOCK1, TGFBI, TMSB4X, TPI1, VIM, FCGBP, NAMPT, SMOC1

The GO enrichment results of the 31 selected genes.

We compared the 31 genes with reported GBM signatures in GeneSigDB (Culhane et al., 2012) and found that the 31 genes were significantly overlapped with a signature called “Human Glioblastoma_Morandi08_22genes” which were from Table 1 of Morandi et al. (2008): the 22 up-regulated genes following camptothecin (CPT) treatment in both U87-MG and DBTRG-05 cells. The hypergeometric test p-value was 0.0157.

Among the 31 genes, several of them plays roles in tumor metastasis. Thymosin β4 (TMSB4X/Tβ4) is associated with tumor metastasis and progression which plays a role in cell proliferation, migration, and differentiation through a TGFβ/MRTF Signaling Axis (Morita and Hayashi, 2018). TMSB4X expression was associated with cancers in a stage- and histology-specific manner and could be an effective prognostic parameter and prognostic index. Thus far, the relationship between TMSB4X and GBM remain unknown. IPCEF1 is the C-terminal half of CNK3 which is required for HGF-dependent Arf6 activation and migration during cancer metastasis (Attar et al., 2012). MTSS1 plays an important role in cancer metastasis. Previous researches indicated that MTSS1 as a potential tumor biomarker and its reduced expression associated with bad prognosis in many cancers. In GBM, MTSS1was reported as a potential tumor suppressor and prognostic biomarker which could suppress cell migration and invasion (Zhang and Qi, 2015).

Several genes can facilitate cancer progression. S100A10 is a calcium binding protein which is found to be significantly correlated with poor survival in patients with gliomas (Sethi et al., 2012). S100A10 has been involved in cancer progression, but the unique function is not well understood (O’Connell et al., 2010). HTRA1 encodes a ubiquitously expressed serine protease with prominent expression in the vasculature. Inhibition of HTRA1 could deregulate angiogenesis in the tumor stroma which plays an important role in tumor progression (Chien et al., 2006; He et al., 2010; Klose et al., 2018).

There are several other reported tumor genes. DHRS9 is a member of the short-chain dehydrogenases/reductases (SDR) family. Recent research found that SDR family members have been involved in tumors (Hu et al., 2016). TPI1 encodes an enzyme, consisting of two identical proteins, which catalyzes the isomerization of glyceraldehydes-3-phosphate (G3P) and dihydroxy-acetone phosphate (DHAP) in glycolysis and gluconeogenesis. TPI1 was down-regulated in response to LLL12 treatment and validated using immunoblot (Jain et al., 2015). It may serve as potential therapeutic targets in GBM (Jain et al., 2015).

Conclusion

Glioblastoma is the most aggressive and incurable primary brain cancer in adults. The most common survival time after diagnosis is 12–15 months, with 5-year survival rate <5%. Symptoms of GBM are non-specific at early stage and the cause of GBM remains elusive. We analysis the data from 2343 tumor cells and 1246 periphery cells using mRMR and IFS method to characterize infiltrating tumor cells, and to define the cellular diversity.

Statements

Data availability statement

The datasets generated for this study can be found in the https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE84465.

Author contributions

S-SF and QC conceived and designed the study. QC, JL, Z-YD, and S-SF performed the data mining and statistical analyses. FF, HC, and Z-YW prepared the figures and tables. QC and JL drafted the initial manuscript. S-SF made critical comments and revision for the initial manuscript. S-SF, QC, and JL had primary responsibility for the final content. All authors reviewed and approved the final manuscript.

Funding

This work was supported by the National Natural Science Foundation of China (No. 81703622), the China Postdoctoral Science Foundation (No. 2018M633002), the Hunan Provincial Natural Science Foundation of China (No. 2018JJ3838), and the Hunan Provincial Health Committee Foundation of China (No. C2019186).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fbioe.2020.00167/full#supplementary-material

TABLE S1

The top 100 mRMR genes.

References

  • 1

    AttarM. A.SalemJ. C.PurselH. S.SantyL. C. (2012). CNK3 and IPCEF1 produce a single protein that is required for HGF dependent Arf6 activation and migration.Exp. Cell Res.318228237. 10.1016/j.yexcr.2011.10.018

  • 2

    BajikarS. S.WangC. C.BortenM. A.PereiraE. J.AtkinsK. A.JanesK. A. (2017). Tumor-suppressor inactivation of GDF11 occurs by precursor sequestration in triple-negative breast cancer.Dev. Cell43418435.e13. 10.1016/j.devcel.2017.10.027

  • 3

    Cancer Genome Atlas Research Network (2008). Comprehensive genomic characterization defines human glioblastoma genes and core pathways.Nature45510611068. 10.1038/nature07385

  • 4

    ChienJ.AlettiG.BaldiA.CatalanoV.MurettoP.KeeneyG. L.et al (2006). Serine protease HtrA1 modulates chemotherapy-induced cytotoxicity.J. Clin. Invest.11619942004. 10.1172/JCI27698

  • 5

    ChinotO. L.WickW.MasonW.HenrikssonR.SaranF.NishikawaR.et al (2014). Bevacizumab plus radiotherapy-temozolomide for newly diagnosed glioblastoma.N. Engl. J. Med.370709722. 10.1056/NEJMoa1308345

  • 6

    CuiW.ChenL.HuangT.GaoQ.JiangM.ZhangN.et al (2013). Computationally identifying virulence factors based on KEGG pathways.Mol. Biosyst.914471452. 10.1039/c3mb70024k

  • 7

    CulhaneA. C.SchröderM. S.SultanaR.PicardS. C.MartinelliE. N.KellyC.et al (2012). GeneSigDB: a manually curated database and resource for analysis of gene expression signatures.Nucleic Acids Res.40D1060D1066. 10.1093/nar/gkr901

  • 8

    DarmanisS.SloanS. A.CrooteD.MignardiM.ChernikovaS.SamghababiP.et al (2017). Single-cell RNA-seq analysis of infiltrating neoplastic cells at the migrating front of human glioblastoma.Cell Rep.2113991410. 10.1016/j.celrep.2017.10.030

  • 9

    DasS.MarsdenP. A. (2013). Angiogenesis in glioblastoma.N. Engl. J. Med.36915611563. 10.1056/NEJMcibr1309402

  • 10

    DingC. H.DubchakI. (2001). Multi-class protein fold recognition using support vector machines and neural networks.Bioinformatics17349358. 10.1093/bioinformatics/17.4.349

  • 11

    FurnariF. B.FentonT.BachooR. M.MukasaA.StommelJ. M.SteghA.et al (2007). Malignant astrocytic glioma: genetics, biology, and paths to treatment.Genes Dev.2126832710. 10.1101/gad.1596707

  • 12

    GilbertM. R.DignamJ. J.ArmstrongT. S.WefelJ. S.BlumenthalD. T.VogelbaumM. A.et al (2014). A randomized trial of bevacizumab for newly diagnosed glioblastoma.N. Engl. J. Med.370699708. 10.1056/NEJMoa1308573

  • 13

    HeX.OtaT.LiuP.SuC.ChienJ.ShridharV. (2010). Downregulation of HtrA1 promotes resistance to anoikis and peritoneal dissemination of ovarian cancer cells.Cancer Res.7031093118. 10.1158/0008-5472.CAN-09-3557

  • 14

    HuL.ChenH. Y.HanT.YangG. Z.FengD.QiC. Y.et al (2016). Downregulation of DHRS9 expression in colorectal cancer tissues and its prognostic significance.Tumour Biol.37837845. 10.1007/s13277-015-3880-6

  • 15

    HuangT.CaiY.-D. (2013). An information-theoretic machine learning approach to expression QTL analysis.PLoS One8:e67899. 10.1371/journal.pone.0067899

  • 16

    HuangT.WangM.CaiY.-D. (2015). Analysis of the preferences for splice codes across tissues.Protein Cell6904907. 10.1007/s13238-015-0226-5

  • 17

    JainR.KulkarniP.DhaliS.RapoleS.SrivastavaS. (2015). Quantitative proteomic analysis of global effect of LLL12 on U87 cell’s proteome: an insight into the molecular mechanism of LLL12.J. Proteomics113127142. 10.1016/j.jprot.2014.09.020

  • 18

    JiangY.HuangT.ChenL.GaoY. F.CaiY.ChouK. C. (2013). Signal propagation in protein interaction network during colorectal cancer progression.Biomed Res. Int.2013:287019. 10.1155/2013/287019

  • 19

    KloseR.AdamM. G.WeisE. M.MollI.Wustehube-LauschJ.TetzlaffF.et al (2018). Inactivation of the serine protease HTRA1 inhibits tumor growth by deregulating angiogenesis.Oncogene3742604272. 10.1038/s41388-018-0258-4

  • 20

    LiB. Q.YouJ.HuangT.CaiY. D. (2014). Classification of non-small cell lung cancer based on copy number alterations.PLoS One9:e88300. 10.1371/journal.pone.0088300

  • 21

    LiZ.GuoJ.MaY.ZhangL.LinZ. (2018). Oncogenic role of MicroRNA-30b-5p in glioblastoma through targeting proline-rich transmembrane protein 2.Oncol. Res.26219230. 10.3727/096504017x14944585873659

  • 22

    LiuJ.AlbrechtA. M.NiX.YangJ.LiM. (2013). Glioblastoma tumor initiating cells: therapeutic strategies targeting apoptosis and microRNA pathways.Curr. Mol. Med.13352357. 10.2174/156652413805076830

  • 23

    LiuL.ChenL.ZhangY. H.WeiL.ChengS.KongX.et al (2017). Analysis and prediction of drug-drug interaction by minimum redundancy maximum relevance and incremental feature selection.J. Biomol. Struct. Dyn.35312329. 10.1080/07391102.2016.1138142

  • 24

    MorandiE.SeveriniC.QuercioliD.D’ArioG.PerdichizziS.CapriM.et al (2008). Gene expression time-series analysis of camptothecin effects in U87-MG and DBTRG-05 glioblastoma cell lines.Mol. Cancer7:66. 10.1186/1476-4598-7-66

  • 25

    MoritaT.HayashiK. (2018). Tumor progression is mediated by thymosin-beta4 through a TGFbeta/MRTF signaling axis.Mol. Cancer Res.16880893. 10.1158/1541-7786.MCR-17-0715

  • 26

    MullerS.KohanbashG.LiuS. J.AlvaradoB.CarreraD.BhaduriA.et al (2017). Single-cell profiling of human gliomas reveals macrophage ontogeny as a basis for regional differences in macrophage activation in the tumor microenvironment.Genome Biol.18:234. 10.1186/s13059-017-1362-4

  • 27

    NiuB.HuangG.ZhengL.WangX.ChenF.ZhangY.et al (2013). Prediction of substrate-enzyme-product interaction based on molecular descriptors and physicochemical properties.BioMed Res. Int.2013:674215. 10.1155/2013/674215

  • 28

    O’ConnellP. A.SuretteA. P.LiwskiR. S.SvenningssonP.WaismanD. M. (2010). S100A10 regulates plasminogen-dependent macrophage invasion.Blood11611361146. 10.1182/blood-2010-01-264754

  • 29

    OstromQ. T.GittlemanH.XuJ.KromerC.WolinskyY.KruchkoC.et al (2016). CBTRUS statistical report: primary brain and other central nervous system tumors diagnosed in the United States in 2009-2013.Neuro Oncol.18(Suppl. 5)v1v75. 10.1093/neuonc/now207

  • 30

    PatelA. P.TiroshI.TrombettaJ. J.ShalekA. K.GillespieS. M.WakimotoH.et al (2014). Single-cell RNA-seq highlights intratumoral heterogeneity in primary glioblastoma.Science34413961401. 10.1126/science.1254257

  • 31

    PengH.LongF.DingC. (2005). Feature selection based on mutual information: criteria of max-dependency, max-relevance, and min-redundancy.IEEE Trans. Pattern Anal. Mach. Intell.2712261238. 10.1109/tpami.2005.159

  • 32

    PuramS. V.TiroshI.ParikhA. S.PatelA. P.YizhakK.GillespieS.et al (2017). Single-cell transcriptomic analysis of primary and metastatic tumor ecosystems in head and neck cancer.Cell17116111624.e24. 10.1016/j.cell.2017.10.044

  • 33

    SethiM. K.BuettnerF. F.AshikovA.KrylovV. B.TakeuchiH.NifantievN. E.et al (2012). Molecular cloning of a xylosyltransferase that transfers the second xylose to O-glucosylated epidermal growth factor repeats of notch.J. Biol. Chem.28727392748. 10.1074/jbc.M111.302406

  • 34

    ShuY.ZhangN.KongX.HuangT.CaiY. D. (2014). Predicting A-to-I RNA editing by feature selection and random forest.PLoS One9:e110607. 10.1371/journal.pone.0110607

  • 35

    StegleO.TeichmannS. A.MarioniJ. C. (2015). Computational and analytical challenges in single-cell transcriptomics.Nat. Rev. Genet.16133145. 10.1038/nrg3833

  • 36

    StuppR.HegiM. E.GilbertM. R.ChakravartiA. (2007). Chemoradiotherapy in malignant glioma: standard of care and future directions.J. Clin. Oncol.2541274136. 10.1200/JCO.2007.11.8554

  • 37

    StuppR.MasonW. P.van den BentM. J.WellerM.FisherB.TaphoornM. J.et al (2005). Radiotherapy plus concomitant and adjuvant temozolomide for glioblastoma.N. Engl. J. Med.352987996. 10.1056/NEJMoa043330

  • 38

    StuppR.TaillibertS.KannerA.ReadW.SteinbergD. M.LhermitteB.et al (2017). Effect of tumor-treating fields plus maintenance temozolomide vs maintenance temozolomide alone on survival in patients with glioblastoma: a randomized clinical trial.JAMA31823062316. 10.1001/jama.2017.18718

  • 39

    SunL.YuY.HuangT.AnP.YuD.YuZ.et al (2012). Associations between ionomic profile and metabolic abnormalities in human population.PLoS One7:e38845. 10.1371/journal.pone.0038845

  • 40

    VerhaakR. G.HoadleyK. A.PurdomE.WangV.QiY.WilkersonM. D.et al (2010). Integrated genomic analysis identifies clinically relevant subtypes of glioblastoma characterized by abnormalities in PDGFRA, IDH1, EGFR, and NF1.Cancer Cell1798110. 10.1016/j.ccr.2009.12.020

  • 41

    XiaoS.YangZ.LvR.ZhaoJ.WuM.LiaoY.et al (2014). miR-135b contributes to the radioresistance by targeting GSK3beta in human glioblastoma multiforme cells.PLoS One9:e108810. 10.1371/journal.pone.0108810

  • 42

    YangJ.ChenL.KongX.HuangT.CaiY. D. (2014). Analysis of tumor suppressor genes based on gene ontology and the KEGG pathway.PLoS One9:e107202. 10.1371/journal.pone.0107202

  • 43

    ZhangG.-L.PanL.-L.HuangT.WangJ.-H. (2019). The transcriptome difference between colorectal tumor and normal tissues revealed by single-cell sequencing.J. Cancer1058835890. 10.7150/jca.32267

  • 44

    ZhangN.HuangT.CaiY. D. (2014). Discriminating between deleterious and neutral non-frameshifting indels based on protein interaction networks and hybrid properties.Mol. Genet. Genomics290343352. 10.1007/s00438-014-0922-5

  • 45

    ZhangN.WangM.ZhangP.HuangT. (2016). Classification of cancers based on copy number variation landscapes.Biochim. Biophys. Acta1860(11 Part B)27502755. 10.1016/j.bbagen.2016.06.003

  • 46

    ZhangP. W.ChenL.HuangT.ZhangN.KongX. Y.CaiY. D. (2015). Classifying ten types of major cancers based on reverse phase protein array profiles.PLoS One10:e0123147. 10.1371/journal.pone.0123147

  • 47

    ZhangS.QiQ. (2015). MTSS1 suppresses cell migration and invasion by targeting CTTN in glioblastoma.J. Neurooncol.121425431. 10.1007/s11060-014-1656-2

  • 48

    ZhaoT. H.JiangM.HuangT.LiB. Q.ZhangN.LiH. P.et al (2013). A novel method of predicting protein disordered regions based on sequence features.BioMed Res. Int.2013:414327. 10.1155/2013/414327

  • 49

    ZhaoX.GaoS.WuZ.KajigayaS.FengX.LiuQ.et al (2017). Single-cell RNA-seq reveals a distinct transcriptome signature of aneuploid hematopoietic cells.Blood13027622773. 10.1182/blood-2017-08-803353

  • 50

    ZhouY.ZhangN.LiB. Q.HuangT.CaiY. D.KongX. Y. (2015). A method to distinguish between lysine acetylation and lysine ubiquitination with feature selection and analysis.J. Biomol. Struct. Dyn.3324792490. 10.1080/07391102.2014.1001793

Summary

Keywords

glioblastoma biomarkers, scRNA-seq, mRMR method, support vector machine, pericarcinomatous environment

Citation

Cheng Q, Li J, Fan F, Cao H, Dai Z-Y, Wang Z-Y and Feng S-S (2020) Identification and Analysis of Glioblastoma Biomarkers Based on Single Cell Sequencing. Front. Bioeng. Biotechnol. 8:167. doi: 10.3389/fbioe.2020.00167

Received

19 December 2019

Accepted

19 February 2020

Published

05 March 2020

Volume

8 - 2020

Edited by

Peilin Jia, The University of Texas Health Science Center at Houston, United States

Reviewed by

Liang Lan, Hong Kong Baptist University, Hong Kong; Guohua Huang, Shaoyang University, China

Updates

Copyright

*Correspondence: Song-Shan Feng,

These authors have contributed equally to this work

This article was submitted to Bioinformatics and Computational Biology, a section of the journal Frontiers in Bioengineering and Biotechnology

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics