We aim to find a biomarker that can effectively predict the prognosis of patients with cutaneous melanoma (CM). The RNA sequencing data of CM was downloaded from The Cancer Genome Atlas (TCGA) database and randomly divided into training group and test group. Survival statistical analysis and machine-learning approaches were performed on the RNA sequencing data of CM to develop a prognostic signature. Using univariable Cox proportional hazards regression, random survival forest algorithm, and receiver operating characteristic (ROC) in the training group, the four-mRNA signature including CD276, UQCRFS1, HAPLN3, and PIP4P1 was screened out. The four-mRNA signature could divide patients into low-risk and high-risk groups with different survival outcomes (log-rank p < 0.001). The predictive efficacy of the four-mRNA signature was confirmed in the test group, the whole TCGA group, and the independent GSE65904 (log-rank p < 0.05). The independence of the four-mRNA signature in prognostic prediction was demonstrated by multivariate Cox analysis. ROC and timeROC analyses showed that the efficiency of the signature in survival prediction was better than other clinical variables such as melanoma Clark level and tumor stage. This study highlights that the four-mRNA model could be used as a prognostic signature for CM patients with potential clinical application value.
Pathological neovascularization in choroid, a leading cause of blindness, is a characteristic of many fundus diseases, such as diabetic retinopathy and age-related macular degeneration. The present study aimed to elucidate the key signaling pathways in choroidal neovascularization (CNV) by analyzing the mRNA profiles of choroid and retina in tree shrews with CNV. We induced choroidal angiogenesis by laser photocoagulation in 15 tree shrews and obtained mRNA profiles of their choroids and retinas by high-throughput transcriptome sequencing. Hierarchical cluster analysis, weighted gene co-expression network analysis (WGCNA), protein-protein interaction (PPI) network analysis, hematoxylin and eosin (HE) staining, CD31 immunohistochemistry (IHC), and reverse transcription quantitative PCR (RT-qPCR) were performed. After laser photocoagulation, we obtained a total of 350 differentially expressed genes (DEGs) in the choroid, including 59 genes in Module-FASN (“ME-FASN”) module and 28 genes in Module-RPL (“ME-RPL”) module. A total of 69 DEGs in retina, including 20 genes in Module-SLC (“ME-SLC”) module. Bioinformatics analysis demonstrated that DEGs in choroid were mainly involved in membrane transport; DEGs in “ME-RPL” were prominent in pathways associated with IgA production, antigen presentation, and cell adhesion molecules (CAMs) signaling. DEGs in “ME-FASN” were involved in fatty acid metabolism and PPAR signaling pathway, while DEGs in “ME-SLC” were involved in GABAergic synapse, neuroactive life receptor interaction, cholinergic synapse, and retrograde endocannabinoid signaling pathway. PPI network analysis demonstrated that the ribosomal protein family genes (RPL31, RPL7, RPL26L1, and RPL19) are key factors of “ME-RPL,” acyl-CoA superfamily genes (ACACA, ACAT1, ACAA2, and ACACB) and FASN are key factors of “ME-FASN” and superfamily of solid carrier genes (SLC17A6, SLC32A1, SLC12A5, and SLC6A1) and complement genes (C4A, C3, and C2) are key factors of “ME-SLC.” In conclusion, the present study discovered the important signal transductions (fatty acid metabolic pathway and CAMs signaling) and genes (ribosomal protein family and the complement system) in tree shrew CNV. We consider that our findings hold implications in unraveling molecular mechanisms that underlie occurrence and development of CNV.
Background: The majority of chronic liver disease is caused by non-alcoholic fatty liver disease (NAFLD), which is one of the highly prevalent diseases worldwide. The current studies have found that non-coding RNA (ncRNA) plays an important role in the NAFLD, but few studies on circRNA. In this study, genes, microRNA (miRNA), and circular RNA (circRNA) associated with NAFLD were found by bioinformatic methods, bringing a novel perspective for the prevention and treatment of NAFLD.
Methods: Expression data of GSE63067 was acquired from Gene Expression Omnibus (GEO) database. The liver samples were collected from the people diagnosed with NAFLD or not. Differentially expressed genes (DEGs) were obtained from the steatosis vs. the control group and non-alcoholic steatohepatitis (NASH) vs. the control group using the GEO2R online tool. The overlapping genes remained for further functional enrichment analysis and protein-protein interaction network analysis. MiRNAs and circRNAs targeting these overlapping DEGs were predicted from the databases. Finally, the GSE134146 dataset was used to verify the expression of circRNA.
Results: In summary, 228 upregulated and 63 downregulated differential genes were selected. The top 10 biological processes and relative signaling pathways of the upregulated differential genes were obtained. Also, ten hub genes were performed in the Protein-protein interaction (PPI) network. One hundred thirty-nine miRNAs and 902 circRNAs were forecast for the differential genes by the database. Ultimately, the crosstalk between hsa_circ_0000313, miR-6512-3p, and PEG10 was constructed.
Conclusion: The crosstalk of hsa_circ_0000313-hsa-miR-6512-3p-PEG10 and some related non-coding RNAs may take part in NAFLD’s pathogenesis, which could be the potential biomarkers of NAFLD in the future.
A question of fundamental biological significance is to what extent the expression of a subset of genes can be used to recover the full transcriptome, with important implications for biological discovery and clinical application. To address this challenge, we propose two novel deep learning methods, PMI and GAIN-GTEx, for gene expression imputation. In order to increase the applicability of our approach, we leverage data from GTEx v8, a reference resource that has generated a comprehensive collection of transcriptomes from a diverse set of human tissues. We show that our approaches compare favorably to several standard and state-of-the-art imputation methods in terms of predictive performance and runtime in two case studies and two imputation scenarios. In comparison conducted on the protein-coding genes, PMI attains the highest performance in inductive imputation whereas GAIN-GTEx outperforms the other methods in in-place imputation. Furthermore, our results indicate strong generalization on RNA-Seq data from 3 cancer types across varying levels of missingness. Our work can facilitate a cost-effective integration of large-scale RNA biorepositories into genomic studies of disease, with high applicability across diverse tissue types.
Cancer is one of the most threatening diseases to humans. It can invade multiple significant organs, including lung, liver, stomach, pancreas, and even brain. The identification of cancer biomarkers is one of the most significant components of cancer studies as the foundation of clinical cancer diagnosis and related drug development. During the large-scale screening for cancer prevention and early diagnosis, obtaining cancer-related tissues is impossible. Thus, the identification of cancer-associated circulating biomarkers from liquid biopsy targeting has been proposed and has become the most important direction for research on clinical cancer diagnosis. Here, we analyzed pan-cancer extracellular microRNA profiles by using multiple machine-learning models. The extracellular microRNA profiles on 11 cancer types and non-cancer were first analyzed by Boruta to extract important microRNAs. Selected microRNAs were then evaluated by the Max-Relevance and Min-Redundancy feature selection method, resulting in a feature list, which were fed into the incremental feature selection method to identify candidate circulating extracellular microRNA for cancer recognition and classification. A series of quantitative classification rules was also established for such cancer classification, thereby providing a solid research foundation for further biomarker exploration and functional analyses of tumorigenesis at the level of circulating extracellular microRNA.
Background: Breast cancer (BC) is one of the most frequently diagnosed malignancies among females. As a huge heterogeneity of malignant tumor, it is important to seek reliable molecular biomarkers to carry out the stratification for patients with BC. We surveyed immune- associated lncRNAs that may be used as potential therapeutic targets in BC.
Methods: LncRNA expression data and clinical information of BC patients were downloaded from the TCGA database for a comprehensive analysis of candidate genes. A model consisting of immune-related lncRNAs enriched in BC cancerous tissues was established using the univariate Cox regression analysis and the iterative Lasso Cox regression analysis. The prognostic performance of this model was validated in two independent cohorts (GSE21653 and BC-KR), and compared with known prognostic biomarkers. A nomogram that integrated the immune-related lncRNA signature and clinicopathological factors was constructed to accurately assess the prognostic value of this signature. The correlation between the signature and immune cell infiltration in BC was also analyzed.
Results: The Kaplan-Meier analysis showed that the OS of Patients in the low-risk group had significantly better survival than those in the high-risk group, Clinical subgroup analysis showed that the predictive ability was independent of clinicopathological factors. Univariate/multivariate Cox regression analysis showed immune lncRNA signature is an important prognostic factor and an independent prognostic marker. In addition, GSEA and GSVA analysis as well as comprehensive analysis of immune cells showed that the signature was significantly correlated with the infiltration of immune cells.
Conclusion: We successfully constructed an immune-associated lncRNA signature that can accurately predict BC prognosis.