Impact Factor 4.151
2017 JCR, Clarivate Analytics 2018

Frontiers journals are at the top of citation and impact metrics

This article is part of the Research Topic

Multi-“omics” in Neurodevelopmental Disorders

Original Research ARTICLE Provisionally accepted The full-text will be published soon. Notify me

Front. Genet. | doi: 10.3389/fgene.2019.00396

Integrative analysis of DiseaseLand Omics Database for Disease Signatures and Treatments: A Bipolar Case Study

 Chun Wu1,  Bevan E. Huang1, Guang Chen1, Tim Lovenberg1,  David Pocalyko1 and  Xiang Yao1*
  • 1Janssen Research and Development, United States

Transcriptomics technologies such as high-throughput next-generation sequencing and microarray platforms provide exciting opportunities for improving diagnosis and treatment of complex diseases. Transcriptomics studies often share similar hypotheses, but are carried out on different platforms, in different conditions, and with different analysis approaches. These factors, in addition to small sample sizes, can result in a lack of reproducibility. A clear understanding and unified picture of many complex diseases are still elusive, highlighting an urgent need to effectively integrate multiple transcriptomic studies for disease signatures. In collaboration with Omicsoft, we have integrated more than 3,000 high-quality transcriptomic datasets in oncology, immunology, neuroscience, cardiovascular and metabolic disease, and from both public and internal sources (DiseaseLand database). We established a systematic data integration and meta-analysis approach, which can be applied in multiple disease areas to create a unified picture of the disease signature and prioritize drug targets, pathways, and compounds. In this study, we provide an illustrative example using our approach to combine a total of 30 genome-wide gene expression studies for insight into the etiology of bipolar disorder. First, the studies were integrated by extracting raw FASTQ or CEL files, then undergoing the same procedures for preprocessing, normalization and statistical inference. Second, both p-value and effect size based meta-analysis algorithms were used to identify differentially expressed (DE) genes. The results revealed a total of 204 significantly DE (FDR < 0.05) genes in the prefrontal cortex. Among these were BDNF, VGF, WFS1, DUSP6, CRHBP, MAOA and RELN, which have previously been implicated in bipolar disorder. Finally, pathway enrichment analysis revealed a role for GPCR, MAPK, immune and Reelin pathways. The ability to robustly combine and synthesize the information from multiple studies enables a more powerful understanding of this complex disease.

Keywords: Meta-analysis, Transcriptomics, Bipolar Disorder, Drug Discovery, data integration, human brain

Received: 22 Dec 2018; Accepted: 11 Apr 2019.

Edited by:

Cristian Bonvicini, Centro San Giovanni di Dio Fatebenefratelli (IRCCS), Italy

Reviewed by:

Elizabeth A. Thomas, The Scripps Research Institute, United States
Sulev Kõks, University of Tartu, Estonia
Azmeraw T. AMARE, South Australian Health and Medical Research Institute (SAHMRI), Australia  

Copyright: © 2019 Wu, Huang, Chen, Lovenberg, Pocalyko and Yao. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Dr. Xiang Yao, Janssen Research and Development, Spring House, California, United States,