Mini Review ARTICLE
Human protein interaction networks across tissues and diseases
- 1Department of Clinical Biochemistry and Pharmacology, Ben-Gurion University of the Negev, Beer-Sheva, Israel
- 2Blavatnik School of Computer Science, Tel Aviv University, Tel Aviv, Israel
Protein interaction networks are an important framework for studying protein function, cellular processes, and genotype-to-phenotype relationships. While our view of the human interaction network is constantly expanding, less is known about networks that form in biologically important contexts such as within distinct tissues or in disease conditions. Here we review efforts to characterize these networks and to harness them to gain insights into the molecular mechanisms underlying human disease.
Protein molecules constitute the main building blocks of cells and mediate most cellular processes. In human, they are encoded by over 22,000 different genes, which give rise to many more proteins through alternative splicing mechanisms. These numerous proteins do not work in isolation: instead, they interact with each other and with other types of molecules to form complex cellular machines and to pass signals within cells and across tissues. In recognition of the fundamental role of these molecular interactions, much effort has been invested in the last two decades in their mapping. From small-scale experiments that measure interactions between a few proteins, mapping has changed to large-scale screens using high-throughput techniques such as yeast two-hybrid and co-immunoprecipitation (e.g., Rual et al., 2005; Stelzl et al., 2005; Ewing et al., 2007; Rolland et al., 2014). Owing to these mapping efforts, our current view of the physical interactions between human proteins encompasses over 200,000 interactions among over 20,000 proteins, and is continuously expanding. The resulting network of all known protein-protein interactions (PPIs), known as the human interactome, has become a key framework for studying protein function, cellular processes, and genotype-to-phenotype relationships, as reviewed elsewhere (Barabási et al., 2011; Vidal et al., 2011). However, this broad network is also limited. PPIs have rarely been measured in the context of distinct cell types, tissues, or in disease conditions, making it difficult to model and understand context-related phenotypes.
While knowledge of human context-specific PPIs is limited, we are witnessing a rapid accumulation of context-specific molecular expression profiles. The human body consists of tens of tissues, sub-tissues, and cell types that differ from one another in morphology and function. In a seminal study published more than a decade ago, Su et al. (2004) opened a window into their molecular characteristics by profiling the transcriptomes of 79 human tissues via DNA microarrays. Other studies profiled the transcriptomes of human tissues by techniques such as massively parallel signature sequencing (Jongeneel et al., 2005), expressed sequence tags (EST) (Hillier et al., 1996), and next generation RNA sequencing (e.g., Illumina's BodyMap 2.0). Most recent is the RNA sequencing of multiple human tissues from a number of individuals by the Genotype Tissue Expression project (Mele et al., 2015). The proteomes of human tissues have also been profiled by immunohistochemistry (Pontén et al., 2009; Uhlén et al., 2015) and mass-spectrometry techniques (Kim et al., 2014; Wilhelm et al., 2014). In addition to efforts to profile normal tissues, profiling techniques have also been employed to characterize different diseases. One of the more prominent initiatives is The Cancer Genome Atlas (TCGA) (Weinstein et al., 2013), which is actively mapping genomic, transcriptomic, proteomic, and epigenomic changes in cancerous tissues compared to normal tissues. These measurements shed light on the parts of the interactome that are active in these diverse contexts, although direct experimentation is required to reveal the actual PPI changes, in particular the formation of novel interactions (Ideker and Krogan, 2012). Below we discuss efforts to harness these context-specific molecular expression profiles to elucidate network properties of human tissues and to identify interaction-based disease mechanisms.
Features of Tissue and Cell-type Specific Networks
Given the lack of context-specific PPIs that were measured in different tissues and cell types, many studies revert to identifying PPIs that are feasible in these contexts. Their underlying assumption is that a PPI is feasible within a specific context if the corresponding proteins are expressed in that context. Of course not all feasible interactions actually take place, as they depend on many other factors such as localization and conformation of the two proteins, yet co-expression is necessary. Additionally, co-expression has often been based on RNA levels, as protein expression levels were rarely available. This approach had been used previously in model organisms to analyze their network dynamics in response to stimuli (Luscombe et al., 2004) or during cell cycle (de Lichtenberg et al., 2005), and has been used extensively for analyzing tissue interactomes (e.g., Lopes et al., 2011; Barshir et al., 2013; Song et al., 2014). Some differences in the sets of PPIs that are feasible within tissues and involve tissue-specific (TS) proteins and globally expressed (GE) “housekeeping” proteins are exemplified in Figure 1.
Figure 1. Feasible protein interactions change between tissues. All protein interactions (A) and feasible protein interactions that connect “global genes,” which are expressed in all three tissues, with tissue-specific genes that are expressed in one tissue out of adipose (B), or thyroid (C), or muscle (D). Data of the genes expressed per tissue were extracted from GTEx Portal (Mele et al., 2015) and limited to genes with 50 counts and above. Data of protein interactions were extracted using MyProteinNet (Basha et al., 2015) from BioGrid (Chatr-Aryamontri et al., 2015), DIP (Xenarios et al., 2002), IntAct (Kerrien et al., 2012), and MINT (Licata et al., 2012) databases. Only global genes that have tissue-specific interactions in each of the three tissues are shown.
One of the first questions that had been asked was whether genes and PPIs that appear to be TS or GE have distinct topological features relative to the generic human interactome or to each other. Dezso et al. (2008) complied transcriptome profiles of 31 tissues, and found that the set of GE genes was larger than previously assumed. They showed that the topology of the GE PPI network was characterized by higher connectivity and shorter paths between proteins relative to the generic interactome. Lin et al. (2009) analyzed the number of interactions (degree), closeness, and betweenness centralities of GE and TS proteins within the generic PPI network. They found that GE genes were more central and may form a core, while clusters of TS genes attach to the core at more peripheral positions in the network. Using the data of Su et al. (2004), Bossi and Lehner (2009) found extensive direct interactions between GE and TS proteins, and suggested a model for the evolution of TS functions through the modification of core cellular processes. Souiai et al. (2011) used EST data across 45 tissues to test whether tissue-specificity is encoded in the interactome. They also found that GE genes were located at the topological center of the interactome. Denoting interactions occurring at a subset of tissues as TS interactions (TSI), they found that TSI involved in regulatory and developmental functions were also central, whereas TSI involved in organ physiological functions were peripheral. Kiran et al. (Kiran and Nagarajaram, 2013) analyzed features of highly connected proteins, namely hubs, in tissue interactomes. They showed that, among other features, TS hubs were associated with a lower degree of interactome centrality as compared with GE hubs. Waldman et al. (2010) analyzed translation efficiency, and showed that genes that were translated more efficiently in a specific tissue encode proteins that tend to have more interactions in that tissue, relative to other proteins in the same tissue.
The application of RNA-sequencing to human tissues revealed that many more transcripts were expressed per tissue than previously acknowledged (Ramsköld et al., 2009). Emig and Albrecht (2011) were among the first to harness RNA-sequencing data to the analysis of tissue interactomes. They showed that, in contrast to previous studies based on microarray profiles, TSI were less common, and were mainly involved in transmembrane transport and receptor activation. They also suggested that a considerable part of tissue-specificity is likely to be achieved by alternative splicing and interactions involving protein isoforms (further discussed in Buljan et al., 2012). In accordance with this suggestion, Ellis et al. (2012) demonstrated experimentally that neural-regulated exons can remodel PPIs by stimulating and repressing different partner interactions. Another study showed that proteins enriched with splice variants tend to occupy central positions in tissue interactomes (Sinha and Nagarajaram, 2014). Recently, it was claimed that splicing play mostly a complementary role in driving cellular specificity, except for the brain, which exhibits a more divergent splicing program (Mele et al., 2015).
Another technological breakthrough that is taking place in recent years is the profiling of proteomes at large scale. Since the correlation between transcript and protein levels is partial (Schwanhausser et al., 2011), proteome profiling opens a more direct way to identify feasible PPIs. Liu et al. (2014) used proteomic data (Kim et al., 2014) to analyze tissue interactomes. They showed that, relative to the generic interactome, tissue interactomes are smaller, sparser, and that hubs may have more important roles. Barshir et al. (2013) combined transcript and protein measurements to create 16 extensive tissue interactomes. Their comparative analysis (Barshir et al., 2014) revealed that each tissue interactome is dominated by a core sub-network that is common to all tissues, with only a small fraction being TS. Most tissue hubs were GE and retained their large PPI degree across tissues, and were enriched in regulatory functions. Lastly, they found in each tissue a significant correlation between transcript expression level and number of PPIs involving the encoded protein.
An important application of tissue interactomes is to shed light on disease mechanisms. Lage et al. (2008) systematically mapped over 1000 heritable diseases to the tissues in which they manifest clinically by using text-mining. They showed that proteins and complexes that were linked to diseases tend to be over-expressed in the tissue where defects cause pathology, with the exception of proteins and complexes associated with cancers. Magger et al. (2012) showed that the usage of tissue interactomes, created from a generic interactome by removing or penalizing interactions involving non-expressed proteins, considerably improved the prioritization of disease genes. Li et al. (2014) assessed tissue interactomes weighted by DNA methylation data, and showed that they enhance prediction of disease genes. Barshir et al. (2014) focused on genes causing hereditary diseases and found that they tend to have PPIs that occur exclusively in the tissue where defects cause pathology. They demonstrated that these tissue-exclusive PPIs can highlight disease mechanisms, and, owing to their small number, suggested that they constitute an efficient filter for interrogating disease etiologies.
Perturbed Networks in Disease
Protein networks are perturbed in disease due to sequence mutations and expression changes. Zhong et al. (2009) were the first to systematically probe the effect of sequence (disease-causing) mutations on PPIs. They focused on known mutations causing Mendelian disorders and categorized them according to whether they have a truncation effect (“truncating,” including nonsense mutations, out-of-frame indels, or defective splicing) or not (“in-frame,” including missense mutations and in-frame indels). They showed that truncating mutations seem to lead to node-removal effects in the PPI network, while in-frame mutations are associated with edge-specific perturbations.
In a later study, Wang et al. (2012) examined the effect of disease-causing mutations using a structurally resolved PPI network, consisting of interactions and their atomic-resolution interfaces. They found that in-frame mutations tend to occur on the interaction interfaces of causal proteins and no similar enrichment was detected in non-interacting domains. This suggests that PPI perturbations play an important role in disease. Additionally, they found that the disease specificity for different mutations on the same gene can be explained by their location within the interface, further underscoring the importance of PPIs for the study of disease mechanisms.
On the technological side, Wei and Yu (Wei et al., 2014) developed an experimental pipeline to examine the consequences of different mutations on protein stability and interactions. They used the pipeline to show that disease causing mutations on interactions interfaces are more likely to perturb the corresponding interactions than mutations away from interfaces. Lambert et al. (2013) developed an experimental pipeline to score modulated interactions. The pipeline couples affinity purification to data-independent mass-spectrometric acquisition. The authors used it to identify interaction changes following disease-associated mutations and drug exposure.
Recently, Rolland et al. (2014) compared the impact of mutations associated with human disorders to that of common variants with no reported phenotypic consequences on PPIs. They focused on 32 genes with 115 disease and common variants, testing up to four disease and four common variants per disease gene for their impact on the ability of the corresponding proteins to interact with known interaction partners. They found that disease variants were 10-fold more likely to perturb interactions than common variants; more than 55% of the 107 interactions tested were perturbed by at least one disease-associated variant. In a follow-up study, Sahni et al. (2015) investigated the consequences of 2890 disease-causing missense mutations in 1140 genes. Out of 197 mutations covering 89 proteins with at least two PPI partners (in the HI-II-14 map of Rolland et al., 2014), 26% were found to cause a complete loss of interactions, 31% resulted in specific loss of some interactions, and 43% did not change the interaction partners. Disease mutations were shown to perturb interactions that are functionally relevant in the particular tissue affected by the specific disease. Sahni et al. further conclude that gain of interactions is a rare event in human disease, finding very little evidence for it.
The Road Ahead
Network biology in the past decade was focused on general networks per species, representing the interaction potential of every two proteins. It is becoming clear that these networks, while providing important insights, do not materialize in all conditions. Rather, different sub-networks are formed in different contexts depending on protein expression, structure, and more. In the future, when interactome measurements become as standard and inexpensive as genome sequencing, one can envision the construction of patient-specific networks that could dramatically improve our understanding of human disease and its treatment. With the accumulation of more individual-specific network data, statistical techniques that are currently limited to sequence data, such as association studies, could be generalized to the network world, ever refining our views of cells and organisms.
Conflict of Interest Statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
We thank Idan Hekselman for his help in preparing the figure. This study was supported by the Israel Science Foundation administered by the Israel Academy of Sciences and Humanities through grant number 860/13 to EYL. RS was supported by a Naomi Kadar Award.
Barshir, R., Basha, O., Eluk, A., Smoly, I. Y., Lan, A., and Yeger-Lotem, E. (2013). The TissueNet database of human tissue protein-protein interactions. Nucleic Acids Res. 41, D841–D844. doi: 10.1093/nar/gks1198
Barshir, R., Shwartz, O., Smoly, I. Y., and Yeger-Lotem, E. (2014). Comparative analysis of human tissue interactomes reveals factors leading to tissue-specific manifestation of hereditary diseases. PLoS Comput. Biol. 10:e1003632. doi: 10.1371/journal.pcbi.1003632
Basha, O., Flom, D., Barshir, R., Smoly, I., Tirman, S., and Yeger-Lotem, E. (2015). MyProteinNet: build up-to-date protein interaction networks for organisms, tissues and user-defined contexts. Nucleic Acids Res. 43, W258–W263. doi: 10.1093/nar/gkv515
Buljan, M., Chalancon, G., Eustermann, S., Wagner, G. P., Fuxreiter, M., Bateman, A., et al. (2012). Tissue-specific splicing of disordered segments that embed binding motifs rewires protein interaction networks. Mol. Cell 46, 871–883. doi: 10.1016/j.molcel.2012.05.039
Chatr-Aryamontri, A., Breitkreutz, B. J., Oughtred, R., Boucher, L., Heinicke, S., Chen, D., et al. (2015). The BioGRID interaction database: 2015 update. Nucleic Acids Res. 43, D470–D478. doi: 10.1093/nar/gku1204
Dezso, Z., Nikolsky, Y., Sviridov, E., Shi, W., Serebriyskaya, T., Dosymbekov, D., et al. (2008). A comprehensive functional analysis of tissue specificity of human gene expression. BMC Biol. 6:49. doi: 10.1186/1741-7007-6-49
Ellis, J. D., Barrios-Rodiles, M., Colak, R., Irimia, M., Kim, T., Calarco, J. A., et al. (2012). Tissue-specific alternative splicing remodels protein-protein interaction networks. Mol. Cell 46, 884–892. doi: 10.1016/j.molcel.2012.05.037
Ewing, R. M., Chu, P., Elisma, F., Li, H., Taylor, P., Climie, S., et al. (2007). Large-scale mapping of human protein-protein interactions by mass spectrometry. Mol. Syst. Bio. 3, 89. doi: 10.1038/msb4100134
Hillier, L. D., Lennon, G., Becker, M., Bonaldo, M. F., Chiapelli, B., Chissoe, S., et al. (1996). Generation and analysis of 280,000 human expressed sequence tags. Genome Res. 6, 807–828. doi: 10.1101/gr.6.9.807
Jongeneel, C. V., Delorenzi, M., Iseli, C., Zhou, D., Haudenschild, C. D., Khrebtukova, I., et al. (2005). An atlas of human gene expression from massively parallel signature sequencing (MPSS). Genome Res. 15, 1007–1014. doi: 10.1101/gr.4041005
Kerrien, S., Aranda, B., Breuza, L., Bridge, A., Broackes-Carter, F., Chen, C., et al. (2012). The IntAct molecular interaction database in 2012. Nucleic Acids Res. 40, D841–D846. doi: 10.1093/nar/gkr1088
Lage, K., Hansen, N. T., Karlberg, E. O., Eklund, A. C., Roque, F. S., Donahoe, P. K., et al. (2008). A large-scale analysis of tissue-specific pathology and gene expression of human disease genes and complexes. Proc. Natl. Acad. Sci. U.S.A. 105, 20870–20875. doi: 10.1073/pnas.0810772105
Lambert, J. P., Ivosev, G., Couzens, A. L., Larsen, B., Taipale, M., Lin, Z. Y., et al. (2013). Mapping differential interactomes by affinity purification coupled with data-independent mass spectrometry acquisition. Nat. Methods 10, 1239–1245. doi: 10.1038/nmeth.2702
Li, M., Zhang, J., Liu, Q., Wang, J., and Wu, F. X. (2014). Prediction of disease-related genes based on weighted tissue-specific networks by using DNA methylation. BMC Med. Genomics 7(Suppl. 2):S4. doi: 10.1186/1755-8794-7-S2-S4
Licata, L., Briganti, L., Peluso, D., Perfetto, L., Iannuccelli, M., Galeota, E., et al. (2012). MINT, the molecular interaction database: 2012 update. Nucleic Acids Res. 40, D857–D861. doi: 10.1093/nar/gkr930
Lin, W. H., Liu, W. C., and Hwang, M. J. (2009). Topological and organizational properties of the products of house-keeping and tissue-specific genes in protein-protein interaction networks. BMC Syst. Biol. 3:32. doi: 10.1186/1752-0509-3-32
Lopes, T. J., Schaefer, M., Shoemaker, J., Matsuoka, Y., Fontaine, J. F., Neumann, G., et al. (2011). Tissue-specific subnetworks and characteristics of publicly available human protein interaction databases. Bioinformatics 27, 2414–2421. doi: 10.1093/bioinformatics/btr414
Luscombe, N. M., Babu, M. M., Yu, H., Snyder, M., Teichmann, S. A., and Gerstein, M. (2004). Genomic analysis of regulatory network dynamics reveals large topological changes. Nature 431, 308–312. doi: 10.1038/nature02782
Magger, O., Waldman, Y. Y., Ruppin, E., and Sharan, R. (2012). Enhancing the prioritization of disease-causing genes through tissue specific protein interaction networks. PLoS Comput. Biol. 8:e1002690. doi: 10.1371/journal.pcbi.1002690
Melé, M., Ferreira, P. G., Reverter, F., DeLuca, D. S., Monlong, J., Sammeth, M., et al. (2015). Human genomics. The human transcriptome across tissues and individuals. Science 348, 660–665. doi: 10.1126/science.aaa0355
Pontén, F., Gry, M., Fagerberg, L., Lundberg, E., Asplund, A., Berglund, L., et al. (2009). A global view of protein expression in human cells, tissues, and organs. Mol. Syst. Biol. 5, 337. doi: 10.1038/msb.2009.93
Ramsköld, D., Wang, E. T., Burge, C. B., and Sandberg, R. (2009). An abundance of ubiquitously expressed genes revealed by tissue transcriptome sequence data. PLoS Comput. Biol. 5:e1000598. doi: 10.1371/journal.pcbi.1000598
Rolland, T., Tasan, M., Charloteaux, B., Pevzner, S. J., Zhong, Q., Sahni, N., et al. (2014). A proteome-scale map of the human interactome network. Cell 159, 1212–1226. doi: 10.1016/j.cell.2014.10.050
Rual, J. F., Venkatesan, K., Hao, T., Hirozane-Kishikawa, T., Dricot, A., Li, N., et al. (2005). Towards a proteome-scale map of the human protein-protein interaction network. Nature 437, 1173–1178. doi: 10.1038/nature04209
Sahni, N., Yi, S., Taipale, M., Fuxman Bass, J. I., Coulombe-Huntington, J., Yang, F., et al. (2015). Widespread macromolecular interaction perturbations in human genetic disorders. Cell 161, 647–660. doi: 10.1016/j.cell.2015.04.013
Sinha, A., and Nagarajaram, H. A. (2014). Nodes occupying central positions in human tissue specific PPI networks are enriched with many splice variants. Proteomics 14, 2242–2248. doi: 10.1002/pmic.201400249
Song, J., Wang, Z., and Ewing, R. M. (2014). Integrated analysis of the Wnt responsive proteome in human cells reveals diverse and cell-type specific networks. Mol. Biosyst. 10, 45–53. doi: 10.1039/C3MB70417C
Souiai, O., Becker, E., Prieto, C., Benkahla, A., De las Rivas, J., and Brun, C. (2011). Functional integrative levels in the human interactome recapitulate organ organization. PLoS ONE 6:e22051. doi: 10.1371/journal.pone.0022051
Stelzl, U., Worm, U., Lalowski, M., Haenig, C., Brembeck, F. H., Goehler, H., et al. (2005). A human protein-protein interaction network: a resource for annotating the proteome. Cell 122, 957–968. doi: 10.1016/j.cell.2005.08.029
Su, A. I., Wiltshire, T., Batalov, S., Lapp, H., Ching, K. A., Block, D., et al. (2004). A gene atlas of the mouse and human protein-encoding transcriptomes. Proc. Natl. Acad. Sci. U.S.A. 101, 6062–6067. doi: 10.1073/pnas.0400782101
Uhlén, M., Fagerberg, L., Hallström, B. M., Lindskog, C., Oksvold, P., Mardinoglu, A., et al. (2015). Proteomics. Tissue-based map of the human proteome. Science 347, 1260419. doi: 10.1126/science.1260419
Waldman, Y. Y., Tuller, T., Shlomi, T., Sharan, R., and Ruppin, E. (2010). Translation efficiency in humans: tissue specificity, global optimization and differences between developmental stages. Nucleic Acids Res. 38, 2964–2974. doi: 10.1093/nar/gkq009
Wang, X., Wei, X., Thijssen, B., Das, J., Lipkin, S. M., and Yu, H. (2012). Three-dimensional reconstruction of protein networks provides insight into human genetic disease. Nat. Biotechnol. 30, 159–164. doi: 10.1038/nbt.2106
Wei, X., Das, J., Fragoza, R., Liang, J., Bastos de Oliveira, F. M., Lee, H. R., et al. (2014). A massively parallel pipeline to clone DNA variants and examine molecular phenotypes of human disease mutations. PLoS Genet. 10:e1004819. doi: 10.1371/journal.pgen.1004819
Weinstein, J. N., Collisson, E. A., Mills, G. B., Shaw, K. R., Ozenberger, B. A., Ellrott, K., et al. (2013). The cancer genome atlas pan-cancer analysis project. Nat. Genet. 45, 1113–1120. doi: 10.1038/ng.2764
Wilhelm, M., Schlegl, J., Hahne, H., Moghaddas Gholami, A., Lieberenz, M., Savitski, M. M., et al. (2014). Mass-spectrometry-based draft of the human proteome. Nature 509, 582–587. doi: 10.1038/nature13319
Xenarios, I., Salwínski, L., Duan, X. J., Higney, P., Kim, S. M., and Eisenberg, D. (2002). DIP, the Database of Interacting Proteins: a research tool for studying cellular networks of protein interactions. Nucleic Acids Res. 30, 303–305. doi: 10.1093/nar/30.1.303
Keywords: protein interaction network, tissue-specific network, disease-specific network, network perturbation, gene expression
Citation: Yeger-Lotem E and Sharan R (2015) Human protein interaction networks across tissues and diseases. Front. Genet. 6:257. doi: 10.3389/fgene.2015.00257
Received: 28 May 2015; Accepted: 17 July 2015;
Published: 19 August 2015.
Edited by:Miguel Andrade, Johannes-Gutenberg University of Mainz, Germany
Reviewed by:Matteo Barberis, University of Amsterdam, Netherlands
Oksana Sorokina, The University of Edinburgh, UK
Copyright © 2015 Yeger-Lotem and Sharan. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Esti Yeger-Lotem, Department of Clinical Biochemistry and Pharmacology, Ben-Gurion University of the Negev, PO Box 653, Beer-Sheva 84105, Israel, email@example.com;
Roded Sharan, Blavatnik School of Computer Science, Tel Aviv University, PO Box 39040, Tel Aviv 69978, Israel, firstname.lastname@example.org