Historeceptomic Fingerprints for Drug-Like Compounds

Shmelkov, Evgeny; Grigoryan, Arsen; Swetnam, James; Xin, Junyang; Tivon, Doreen; Shmelkov, Sergey V.; Cardozo, Timothy

doi:10.3389/fphys.2015.00371

ORIGINAL RESEARCH article

Front. Physiol., 18 December 2015

Sec. Systems Biology Archive

Volume 6 - 2015 | https://doi.org/10.3389/fphys.2015.00371

Historeceptomic Fingerprints for Drug-Like Compounds

1. Department of Biochemistry and Molecular Pharmacology, New York University School of Medicine New York, NY, USA
2. Google Inc., Mountain View CA, USA
3. GeneCentrix Inc. New York, NY, USA
4. Department of Neuroscience and Physiology, New York University School of Medicine New York, NY, USA
5. Department of Psychiatry, New York University School of Medicine New York, NY, USA

Abstract

Most drugs exert their beneficial and adverse effects through their combined action on several different molecular targets (polypharmacology). The true molecular fingerprint of the direct action of a drug has two components: the ensemble of all the receptors upon which a drug acts and their level of expression in organs/tissues. Conversely, the fingerprint of the adverse effects of a drug may derive from its action in bystander tissues. The ensemble of targets is almost always only partially known. Here we describe an approach improving upon and integrating both components: in silico identification of a more comprehensive ensemble of targets for any drug weighted by the expression of those receptors in relevant tissues. Our system combines more than 300,000 experimentally determined bioactivity values from the ChEMBL database and 4.2 billion molecular docking scores. We integrated these scores with gene expression data for human receptors across a panel of human tissues to produce drug-specific tissue-receptor (historeceptomics) scores. A statistical model was designed to identify significant scores, which define an improved fingerprint representing the unique activity of any drug. These multi-dimensional historeceptomic fingerprints describe, in a novel, intuitive, and easy to interpret style, the holistic, in vivo picture of the mechanism of any drug's action. Valuable applications in drug discovery and personalized medicine, including the identification of molecular signatures for drugs with polypharmacologic modes of action, detection of tissue-specific adverse effects of drugs, matching molecular signatures of a disease to drugs, target identification for bioactive compounds with unknown receptors, and hypothesis generation for drug/compound phenotypes may be enabled by this approach. The system has been deployed at drugable.org for access through a user-friendly web site.

Introduction

Enormous quantities of “omics” data characterizing both normal and diseased tissues continue to accumulate, leading to the development of increasingly complex molecular biomarkers for diseases. The majority of drugs in current clinical use were discovered by phenotypic screens, leaving their precise mechanism of action unknown. Many if not most of these drugs likely act polypharmacologically (on multiple receptors simultaneously). These two trends result in a growing knowledge gap between the efforts to mechanistically and genomically characterize diseases on the molecular level and the chemicals used for their treatment (Figure 1).

Figure 1

Polypharmacology partly addresses this gap and has gained increasing attention in the field of drug discovery (Peters, 2011). At least some approved drugs exhibit polypharmacological signatures by interacting with multiple targets(Ashburn and Thor, 2004; Keiser et al., 2007, 2009; Mestres et al., 2009; Durrant et al., 2010; Yang et al., 2011). The identification of more of the ensemble of these targets is essential for both understanding the mechanism of drug action and predicting toxicity (Cereto-Massagué et al., 2015). Moreover, the development of compounds that rationally interact with multiple targets is appealing in the case of complex multigenic diseases, such as cancer (Knight et al., 2010) or psychiatric disorders (Metz and Hajduk, 2010; Allen and Roth, 2011; Brown and Okuno, 2012). Improved polypharmacological profiles of a drug can be identified only by a more comprehensive analysis of drug-target interactions on a proteome-wide scale (Xie et al., 2012).

In recent years, growing databases of compound-receptor bioactivities have become available (Wang et al., 2009; Sharman et al., 2011; Gaulton et al., 2012). However, the complete universe of bioactivity scores between putative or actual drugs/compounds and their receptors is still far from approachable. A number of ligand-based and structure-based in silico approaches emerged to address the off-target identification aspect of this issue (Rognan, 2013). Ligand-based approaches are based on an assumption that chemically similar structures are more likely to have similar pharmacological profiles. The idea behind the structure-based off-target identification approaches is based on inverse docking (Chen and Zhi, 2001), where a single compound is docked to multiple targets and the potential biological targets are ranked based on the docking (Chen and Zhi, 2001; Paul et al., 2004; Gao et al., 2008; Yang et al., 2009; Durrant et al., 2010; Li et al., 2010a,b; Grinter et al., 2011).

The combination of in silico target identification methods and growing databases of experimental bioactivity scores improves the feasibility of using these methods to identify a significant subset of the complete ensemble of receptors for known drugs and drug-like compounds by computational approaches. However, a gap would still remain between the polypharmacology of a drug and its pharmacodynamics, i.e., the distribution of its receptor targets in the human body. In order for the affinity of a drug for a given receptor in a given tissue to be a significant factor, the receptor has to be expressed in this tissue. For example, no matter how high the affinity of LSD is for the serotonin 5-HT2a receptor (HTR2A), this drug-target interaction is not physiologically significant in uterine tissue as HTR2A is not expressed there. The true fingerprint of drug action is the totality (“omics”) of receptors for which a drug has affinity, weighted by the expression levels of these receptors in the tissues (“histos”) across human body. Hence we introduced the term “historeceptomic fingerprint” for the holistic signature of drug action. Thus, here, we aim to develop a novel approach for the identification of historeceptomic fingerprints for any given drug/compound.

Methods

Chemical library

Chemical structures in Drugable were obtained from three sources: DrugBank, PubChem, and ChEMBL. 1423 approved and 4752 experimental drugs were imported from DrugBank 2.5 via the XML format release. An additional 1,138,288 compounds were imported from the SDF format release of ChEMBL 14. Additionally, PubChem compound identifiers from the SDF release were assigned to 1,006,895 DrugBank or ChEMBL compounds in Drugable on the basis of equal canonical SMILES strings as computed from RDKit (Landrum, 2008). Overall 1,141,434 unique chemical structures are represented in Drugable.

Compound-compound associations

Compound-compound associations were evaluated as a chemical similarity measure between two compounds and derived as Tanimoto distance between their molecular fingerprints as implemented in the RDKit PostgreSQL extension. Briefly, given a molecule, all linear and non-linear fragments of different size were enumerated and hashed into a bit string called a fingerprint. The Tanimoto coefficient, T, for two fingerprints was calculated as the number of bits in which they differ divided by the number of non-zero bits they have in common. The Tanimoto distance was defined as 1—T. Compounds are shown in the “Similar Compounds” section of a compound page if their Tanimoto distance is less than 0.5.

Protein library

20,266 Human proteins were imported from the XML release of UniProt into drugable.org.

Structure library

3D Structures for the human proteins imported as above were obtained from two sources, the Pocketome (Abagyan and Kufareva, 2009) and ModBase (Pieper et al., 2006). 6857 experimental structures come from the Pocketome and 64,801 homology models are available from ModBase.

Consideration of receptor flexibility is crucial for structure-based drug design and the conformational ensembles of protein receptors derived from Pocketome are a practical alternative to mimic receptor flexibility. However, blindly adding certain conformations to an ensemble may be counterproductive (Rueda et al., 2010). To ensure the high quality of selected conformers, we performed retrospective virtual screening experiments and only structures with high separation power of known ligand binders from decoys were selected. Initially, for a benchmark screen, pockets on Pocketome human proteins (Table 1) were screened against a custom chemical library consisting of compounds solved crystallographically with several proteins and 100 random chemical decoys in order to measure the docking quality of the pockets. Having established that only the highest quality pockets could produce accurate docking scores, a subset of 6857 high quality X-ray conformations of 570 human protein targets from Pocketome was imported into the data warehouse. The 4.2 billion scores generated for Pubchem Bioassay, ChEMBL, and DrugBank compounds against these 6857 high quality pockets on 570 protein targets from the Pocketome have been integrated into the drugable.org historeceptomics system. Where there are multiple conformations for a pocket, the best score was retained. An additional complete matrix of docking scores of 4313 unique chemotypes from drugbank against ModBase homology model database is available in raw form from the authors. As a complete matrix, this data can be used for routine mathematical transformations to study symmetries and trends in the data that relate to polypharmacology. In all, docking to the largest possible set of pockets representing the druggable human genome was evaluated in this study.

Table 1

Data set	No. of pockets	Mean AUC
All	7553	0.57
All TP ≥ 5 and TN ≥ 5	6017	0.569
Homology only and TP ≥ 5 and TN ≥ 5	2128	0.528
Pocketome only and TP ≥ 5 and TN ≥ 5	3889	0.591

Assessment of docking performance.

TP and TN are the numbers of positive and negative bioactivity values available for a given pocket on a protein. Since estimation of AUC for pockets with a very small number of bioactivity values may not be fair, we also provide estimates obtained on pockets with at least 5 positive and 5 negative bioactivity values.

Pharmareceptomic (bioactivity or docking) scores

In order to score the probability of interaction of compounds to a comprehensive set of protein targets, we used the largest available set of experimentally obtained bioactivities and in silico predicted compound-protein docking associations.

Source of in vitro binding data

1,062,908 experimental compound-protein binding affinity measurements were downloaded from ChEMBL 14 PostgreSQL release. We used only binding measurements annotated with a confidence score ≥7, “assay type” field of “B,” or direct protein-ligand binding, and “standard_type” field of “Kd,” “Ki,” or “Potency.” All compound-protein associations obtained from ChEMBL are linked to their original scientific publications in PubMed where data was available from ChEMBL.

Source of in silico docking data

More than four billion compound-protein associations were derived from in silico docking experiments. The AutoDock docking program was used for the docking calculations and all the parameters were set to default values. AutoDock addresses the docking issue as a global optimization problem of an energy function, implementing an iterated local search global optimizer, using the Broyden-Fletcher-Goldfarb-Shanno criterion for local search (Trott and Olson, 2010).

Target Structure Preparation: The approach is intended to be proteome wide. Therefore, many targets with unknown biological function are expected to be available from structural genomics efforts for this approach. In order to simulate the realistic situation wherein the specific functional site on a new crystallographically resolved target receptor with unknown biological function is unknown, we rendered pockets on all receptors blindly based only on the structure coordinates and randomly selected one pocket per receptor. This pocket was then defined as the binding site for docking. Receptors were then set-up by deleting the chains, heteroatoms, and prosthetic groups not involved in the binding site definition using ICM Browser (Molsoft LLC, La Jolla CA). Protein atom types were assigned, and hydrogen atoms and missing heavy atoms were added. The added or zero occupancy side chains and polar hydrogen atoms were optimized and assigned the lowest energy. Tautomeric states of histidines and the rotations of asparagine and glutamine side chain amidic groups were optimized to improve the hydrogen-bonding patterns. The cognate ligands were deleted from the complexes only after hydrogen optimization. Following this receptor preparation, we used the prepare_receptor4.py script (a part of the AutoDock Tools distribution) with default settings to convert the PDB models produced by ICM to the native PDBQT format of AutoDock.

Ligand Structure Preparation: For each compound, bond orders, tautomeric forms, stereochemistry, hydrogen atoms, and protonation states were assigned automatically by the AutoDock chemical conversion procedure. Each ligand was assigned the modified X-Score force field atom types and charges implemented in Arg. Canonical SMILES of each ligand to be screened were matched to the appropriate PubChem 3D structure (Bolton et al., 2011) to be used as a starting conformation for AutoDock docking.

After each docking simulation a stack of diverse binding poses was generated, and their respective docking scores were evaluated using the AutoDock scoring function (Trott and Olson, 2010). Three docking runs were performed for each compound-pocket pair; all binding poses accumulated after each run were merged in a single conformational stack and ranked based on their binding scores; finally, the conformation with the best docking score was retained.

Predicted pharmareceptomics score (probability) of compound-target interaction

In our approach, the pharmareceptomics score is equivalent to the estimated probability that the compound will interact with the target at a physiologically significant level. For experimental bioactivities, the pharmareceptomics score is set equal to experimental affinity. For docking scores, we used the relationship between binding affinity and docking score published in Husby et al. (2015) to estimate a pharmareceptomics score from a docking score.

Protein target–gene expression associations

Gene expression patterns of protein targets from a diverse set of tissues and cell types were derived from the “GeneAtlas U133A, gcrma” dataset (Su et al., 2004) via the BioGPS web-tool (http://biogps.org/, accessed on 5/7/2013; Wu et al., 2009, 2013). If for a given gene, data from multiple probes/experiments were available, the mean of those values was used. For each target protein, the level of expression in each tissue was normalized with regard to its level of expression in all tissues of the dataset and projected into the Z-score.

Data access

The system (“Drugable”) is accessible via user-friendly interface at http://drugable.org/. A flexible free-text search index is available for common names of compounds and targets, medical conditions, etc. Chemical drawer allows user to search by chemical similarity or substructure.

For example when searching Drugable by compound common name, the user is presented with compound chemical structure, compound information (Number of Hydrogen Bond Donors and Hydrogen Bond Acceptors, Number of Rotatable Bonds, Number of Rings, Walden-Crippen LogP, Indication, Pharmacology, Mechanism of Action etc.), and a table of compound-protein associations (experimentally derived and/or predicted by in silico docking experiments) available for this specific compound. The resulting table gives a list of protein targets of the compound of interest with reported or predicted affinity, including protein target UniProt accession ID, the measured activity value and type or docking score. Note that all the experimentally obtained activities are displayed in nM. In addition, a list of compounds that are chemically similar to the compound of interest is also presented. Furthermore, tissue-specific levels of expression for all genes, correspond to the protein targets of the compound of interest, are presented as a heat map.

Alternatively, a user may want to search for a particular protein of interest. In this case, the user is presented with details of the protein target, such as X-ray structure (if available), protein name synonyms, gene names, organism this protein belongs to, and UniProt accession ID.

Furthermore, users may search for a medical condition of interest. In this case user is presented with a list of drugs/drug-like compounds as well as protein targets associated with this medical condition.

Results

Generation of bioactivity scores

First, we generated bioactivity probability scores for the compound-receptor pairs by executing the largest computational molecular docking reported to date (see Section Methods). A benchmark docking screen was performed against 3D structural models of human proteins (Table 1). The mean area under the receiver operating curve (AUC) for benchmark docking was 0.59 (with about 23% of structures having separation power above 0.7) when performed on 3D structural models from Pocketome (Kufareva et al., 2012), but only 0.53 (with 8.5% of structures above AUC of 0.7) on ModBase (Pieper et al., 2006) homology models proteins (Table 2 and Supplementary Table 1). This result suggests that only the docking scores achieved with the highest quality Pocketome pockets should be included in our “omics” set of compound-receptor scores, which are used to predict mechanistic signatures solely from chemotype. The Pocketome currently includes 6857 pockets derived from high quality crystallographic structures of 570 target human proteins. Therefore, for our “omics” set we docked over 600,000 unique non-overlapping chemical structures from PubChem Bioassay, ChEMBL, and DrugBank against these 6857 pockets for a total of 4.2 billion pairwise docking scores between compounds and targets. These “omics” in silico docking scores together with the compound–receptor affinities obtained experimentally, constitute the bioactivity scores data set (Figure 2), which comprise a significant fraction of the druggable targets encoded in the human genome, by one estimate to be around 4000 targets (Reardon, 2013).

Table 2

AUC threshold	Source	No. of receptors	% of receptors
0.9	All	77	1.3
	Homology	20	0.9
	Pocketome	57	1.5
0.8	All	389	6.5
	Homology	55	2.6
	Pocketome	334	8.6
0.7	All	1090	18.1
	Homology	180	8.5
	Pocketome	910	23.4
0.6	All	2575	42.8
	Homology	551	25.9
	Pocketome	2024	52.0

Number of receptors from the benchmark study with AUC above a certain threshold.

Figure 2

Generation of historeceptomic scores

To address the issue of physiological significance of drug targets detected in the first step, we endeavored to calculate a tissue-specific (historeceptomic) compound-receptor score (Figure 3A). Tissue-specific gene expression data on protein targets were obtained from the BioGPS database. The level of expression of each receptor in every tissue was normalized with regard to its expression level in all tissues of the dataset by calculating its standard score (Z-score, see Section Methods). Each compound-receptor association in each tissue was scored by integrating their bioactivity with the receptor expression in a given tissue as follows:

where Hs is a historeceptomic score, Ps is a bioactivity score, and Z is the gene-expression Z-score.

Figure 3

By this method, for any given drug/compound, thousands of historeceptomic scores can be generated, but only a tiny fraction of these, which measure the probability that the compound will affect the receptor in a physiologically significant way, are important. The average drug may have hundreds of low affinity receptors, resulting in a set of scores numbering in the tens of thousands across all tissues in the human body. To identify the physiologically significant compound-receptor interactions out of the large number of all on-/off-target interactions of a given compound, we used the generalized extreme Studentized deviate test as a statistical novelty detection approach using the α = 0.0001 level of significance (Figure 3B). Statistically significant historeceptomic scores of a given drug/compound form its historeceptomic fingerprint.

Fingerprints were pre-calculated for all known drugs into an integrated system suitable for searching with any chemical structure to find its historeceptomic fingerprint. The system includes the 4.2 billion docking scores with experimental affinity scores in a graph linking drugs/compounds to protein targets in order to maximize the sensitivity of target detection for any drug.

Illustrative use case

Historeceptomics fingerprints may specifically localize in vivo significant mechanisms of action of a polypharmacologic drug, translating purely molecular data into a clinically interpretable profile. An example is shown in Figure 3A. Lysergic acid diethylamide (LSD) is a hallucinogenic drug in humans, which makes it difficult to study in animal models, as many hallucinations are only represented internally and can only be communicated verbally. We calculated the historeceptomics profile for LSD. In this case, the inputs into our system were only molecular in nature: the affinity scores and the expression data. We did not use docking in this example. Our historeceptomics approach identified the 5HT2A receptor in the prefrontal cortex (PFC) as the most significant of tissue-target pair associated with the phenotype induced by LSD. Independently, we analyzed the preclinical and clinical literature on LSD targets, which is exclusively non-molecular data. The textbook and literature consensus from animal neuroperturbation studies, pharmacologic studies and clinical neuroimaging is that 5HT2A is the primary molecular target of LSD, and that, specifically, its activity in the PFC is responsible for its effects. Thus, there are many non-molecular clinical and translational papers in the literature, none of which were input to our system, that clearly establish 5HT2A specifically in the PFC not only as a key pathway for LSD psychosis, but also as the epicenter of the very similar psychoses seen in human schizophrenia (Arvanov et al., 1999; Vollenweider and Geyer, 2001; Muschamp et al., 2004; Nichols, 2004). The historeceptomics approach predicts this finding independently of animal or clinical studies.

Discussion

This report takes on the two major challenges of precisely describing the holistic pharmacodynamics of drugs. First, we expanded the graph of experimental scores linking drugs/compounds to protein targets, which has been used in prior methods such as SEA (Keiser et al., 2007), to include the data from the largest computational molecular docking of compounds to protein pockets yet reported. This should increase the sensitivity of target detection. Second, we addressed, for the first time, the systematic integration of bioactivity/docking scores between drugs/compounds and proteins with the expression patterns of those proteins in human tissues, thus mapping the pharmacology of drugs into human physiologic space.

The integration of bioactivity/docking scores of compound-receptors with the expression patterns of those receptors in human tissues increases the specificity of the results by eliminating noise and selecting only physiologically significant drug-target interactions. Thus, although for many models/pockets the docking scores correlate only moderately with affinity due to the limited ability to take induced fit into account, this lack of specificity is abrogated by our integration of the gene expression such that many false positives are likely to be culled. While sensitivity is low, it can be steadily improved from our pioneering prototype by (1) improved binding site (pocket) selection methods and (2) natural growth and improved curation of the crystallographic and bioactivity databases.

There are 20,198 reviewed human proteins in UniProt, of which 4300 have human crystal structures in the PDB (21.3% of total). An additional 20–30% of these can likely be modeled reliably by homology. Thus, up to 50% of the “proteome” might already be surveyed by docking. Estimates of the druggable genome range from 8 to 12 thousand targets. The existing structures are probably highly enriched in these targets so, one can speculate that 40–50% of the druggable genome is already accessible by docking. These are highly speculative estimates, but since the number of crystal structures and the power of computation is growing rapidly, it is not unreasonable to speculate that a low resolution representation of the majority of the druggable genome could be available for docking soon.

The system has been deployed for access through a user-friendly web site: drugable.org. For compounds resulting from phenotype screens, where their mechanism of action is not known, searching the site can identify possible mechanisms of action. Similarly, where the tissue pattern of a disease is known, drug activity detected by our approach in tissues not included in the pattern could be suggestive of the mechanism of the adverse effects of a drug. Since the historeceptomic fingerprints contain both a specific pattern of targets and a specific pattern of tissues, they could potentially be matched to complex biomarkers of disease derived from exhaustive molecular profiling, which can have a similar gene-tissue signature. Our novel approach thus potentially fills a currently existing gap between burgeoning “omics” data and drugs/drug-like compounds (Figure 1).

Funding

This work was supported by an American Recovery and Reinvestment Act grant from the National Library of Medicine (RC LM010994 to TC). Additional support was provided by Google, Inc. via their Exacycle for Visiting Faculty award program (to JS and TC).

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest. Timothy Cardozo and Sergey V. Shmelkov are shareholders in Genecentrix Inc., a company founded after this study was completed, but based, in part, on this research. No funding was received from Genecentrix Inc. for the present study.

Statements

Author contributions

ES, AG, JS designed the study, performed experiments, analyzed data, and assisted in writing the manuscript; JX and DT performed experiments and analyzed data; SS. designed the study, analyzed data, and wrote the manuscript; TC conceived and designed the study, analyzed data, and wrote the manuscript.

Acknowledgments

We thank Ruben Abagyan, Maxim Totrov, Yuval Kluger, Fabio Parisi and Francesco Strini for helpful discussions. We thank the Visiting Faculty for Exacycle program for support of the work of JS and Daniel Belov, Chris van Arsdale, David Konerding and Daniel Meredith of Google Inc. for technical assistance.

Conflict of interest

Supplementary material

The Supplementary Material for this article can be found online at: http://journal.frontiersin.org/article/10.3389/fphys.2015.00371

References

1
AbagyanR.KufarevaI. (2009). The flexible pocketome engine for structural chemogenomics. Methods Mol. Biol.575, 249–279. 10.1007/978-1-60761-274-2_11
2
AllenJ. A.RothB. L. (2011). Strategies to discover unexpected targets for drugs active at G protein-coupled receptors. Annu. Rev. Pharmacol. Toxicol.51, 117–144. 10.1146/annurev-pharmtox-010510-100553
3
ArvanovV. L.LiangX. F.RussoA.WangR. Y. (1999). LSD and DOB: interaction with 5-HT2A receptors to inhibit NMDA receptor-mediated transmission in the rat prefrontal cortex. Eur. J. Neurosci.11, 3064–3072. 10.1046/j.1460-9568.1999.00726.x
4
AshburnT. T.ThorK. B. (2004). Drug repositioning: identifying and developing new uses for existing drugs. Nat. Rev. Drug Discov.3, 673–683. 10.1038/nrd1468
5
BoltonE. E.ChenJ.KimS.HanL.HeS.ShiW.et al. (2011). PubChem3D: a new resource for scientists. J. Cheminform.3:32. 10.1186/1758-2946-3-32
6
BrownJ. B.OkunoY. (2012). Systems biology and systems chemistry: new directions for drug discovery. Chem. Biol.19, 23–28. 10.1016/j.chembiol.2011.12.012
7
Cereto-MassaguéA.OjedaM. J.VallsC.MuleroM.PujadasG.Garcia-VallveS. (2015). Tools for in silico target fishing. Methods71, 98–103. 10.1016/j.ymeth.2014.09.006
8
ChenY. Z.ZhiD. G. (2001). Ligand-protein inverse docking and its potential use in the computer search of protein targets of a small molecule. Proteins43, 217–226. 10.1002/1097-0134(20010501)43:2<217::AID-PROT1032>3.0.CO;2-G
9
DurrantJ. D.AmaroR. E.XieL.UrbaniakM. D.FergusonM. A.HaapalainenA.et al. (2010). A multidimensional strategy to detect polypharmacological targets in the absence of structural and sequence homology. PLoS Comput. Biol.6:e1000648. 10.1371/journal.pcbi.1000648
10
GaoZ.LiH.ZhangH.LiuX.KangL.LuoX.et al. (2008). PDTD: a web-accessible protein database for drug target identification. BMC bioinformatics9:104. 10.1186/1471-2105-9-104
11
GaultonA.BellisL. J.BentoA. P.ChambersJ.DaviesM.HerseyA.et al. (2012). ChEMBL: a large-scale bioactivity database for drug discovery. Nucleic Acids Res.40, D1100–D1107. 10.1093/nar/gkr777
12
GrinterS. Z.LiangY.HuangS. Y.HyderS. M.ZouX. (2011). An inverse docking approach for identifying new potential anti-cancer targets. J. Mol. Graph. Model.29, 795–799. 10.1016/j.jmgm.2011.01.002
13
HusbyJ.BottegoniG.KufarevaI.AbagyanR.CavalliA. (2015). Structure-based predictions of activity cliffs. J. Chem. Inf. Model.55, 1062–1076. 10.1021/ci500742b
14
KeiserM. J.RothB. L.ArmbrusterB. N.ErnsbergerP.IrwinJ. J.ShoichetB. K. (2007). Relating protein pharmacology by ligand chemistry. Nat. Biotechnol.25, 197–206. 10.1038/nbt1284
15
KeiserM. J.SetolaV.IrwinJ. J.LaggnerC.AbbasA. I.HufeisenS. J.et al. (2009). Predicting new molecular targets for known drugs. Nature462, 175–181. 10.1038/nature08506
16
KnightZ. A.LinH.ShokatK. M. (2010). Targeting the cancer kinome through polypharmacology. Nat. Rev. Cancer10, 130–137. 10.1038/nrc2787
17
KufarevaI.IlatovskiyA. V.AbagyanR. (2012). Pocketome: an encyclopedia of small-molecule binding sites in 4D. Nucleic Acids Res.40, D535–D540. 10.1093/nar/gkr825
18
LandrumG. (2008). RDKit: Open-source Cheminformatics. Available online at: http://www.rdkit.org
- Google Scholar
19
LiL.Bum-ErdeneK.BaenzigerP. H.RosenJ. J.HemmertJ. R.NellisJ. A.et al. (2010a). BioDrugScreen: a computational drug design resource for ranking molecules docked to the human proteome. Nucleic Acids Res.38, D765–D773. 10.1093/nar/gkp852
20
LiL.LiJ.KhannaM.JoI.BairdJ. P.MerouehS. O. (2010b). Docking small molecules to predicted off-targets of the cancer drug erlotinib leads to inhibitors of lung cancer cell proliferation with suitable in vitro pharmacokinetic properties. ACS Med. Chem. Lett.1, 229–233. 10.1021/ml100031a
21
MestresJ.Gregori-PuigjanéE.ValverdeS.SoléR. V. (2009). The topology of drug-target interaction networks: implicit dependence on drug properties and target families. Mol. Biosyst.5, 1051–1057. 10.1039/b905821b
22
MetzJ. T.HajdukP. J. (2010). Rational approaches to targeted polypharmacology: creating and navigating protein-ligand interaction networks. Curr. Opin. Chem. Biol.14, 498–504. 10.1016/j.cbpa.2010.06.166
23
MuschampJ. W.ReginaM. J.HullE. M.WinterJ. C.RabinR. A. (2004). Lysergic acid diethylamide and [-]-2,5-dimethoxy-4-methylamphetamine increase extracellular glutamate in rat prefrontal cortex. Brain Res.1023, 134–140. 10.1016/j.brainres.2004.07.044
24
NicholsD. E. (2004). Hallucinogens. Pharmacol. Ther.101, 131–181. 10.1016/j.pharmthera.2003.11.002
25
PaulN.KellenbergerE.BretG.MüllerP.RognanD. (2004). Recovering the true targets of specific ligands by virtual screening of the protein data bank. Proteins54, 671–680. 10.1002/prot.10625
26
PetersJ. U. (2011). Polypharmacology in Drug Discovery.Hoboken, NJ: John Wiley & Sons, Inc.
- Google Scholar
27
PieperU.EswarN.DavisF. P.BrabergH.MadhusudhanM. S.RossiA.et al. (2006). MODBASE: a database of annotated comparative protein structure models and associated resources. Nucleic Acids Res.34, D291–D295. 10.1093/nar/gkj059
28
ReardonS. (2013). Project ranks billions of drug interactions. Nature503, 449–450. 10.1038/503449a
29
RognanD. (2013). Proteome-scale docking: myth and reality. Drug Discov. Today Technol.10, e403–e409. 10.1016/j.ddtec.2013.01.003
30
RuedaM.BottegoniG.AbagyanR. (2010). Recipes for the selection of experimental protein conformations for virtual screening. J. Chem. Inf. Model.50, 186–193. 10.1021/ci9003943
31
SharmanJ. L.MpamhangaC. P.SpeddingM.GermainP.StaelsB.DacquetC.et al. (2011). IUPHAR-DB: new receptors and tools for easy searching and visualization of pharmacological data. Nucleic Acids Res.39, D534–D538. 10.1093/nar/gkq1062
32
SuA. I.WiltshireT.BatalovS.LappH.ChingK. A.BlockD.et al. (2004). A gene atlas of the mouse and human protein-encoding transcriptomes. Proc. Natl. Acad. Sci. U.S.A.101, 6062–6067. 10.1073/pnas.0400782101
33
TrottO.OlsonA. J. (2010). AutoDock Vina: improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading. J. Comput. Chem.31, 455–461. 10.1002/jcc.21334
34
VollenweiderF. X.GeyerM. A. (2001). A systems model of altered consciousness: integrating natural and drug-induced psychoses. Brain Res. Bull.56, 495–507. 10.1016/S0361-9230(01)00646-3
35
WangY.XiaoJ.SuzekT. O.ZhangJ.WangJ.BryantS. H. (2009). PubChem: a public information system for analyzing bioactivities of small molecules. Nucleic Acids Res.37, W623–W633. 10.1093/nar/gkp456
36
WuC.MacleodI.SuA. I. (2013). BioGPS and MyGene.info: organizing online, gene-centric information. Nucleic Acids Res.41, D561–D565. 10.1093/nar/gks1114
37
WuC.OrozcoC.BoyerJ.LegliseM.GoodaleJ.BatalovS.et al. (2009). BioGPS: an extensible and customizable portal for querying and organizing gene annotation resources. Genome Biol.10:R130. 10.1186/gb-2009-10-11-r130
38
XieL.KinningsS. L.BourneP. E. (2012). Novel computational approaches to polypharmacology as a means to define responses to individual drugs. Annu. Rev. Pharmacol. Toxicol.52, 361–379. 10.1146/annurev-pharmtox-010611-134630
39
YangL.LuoH.ChenJ.XingQ.HeL. (2009). SePreSA: a server for the prediction of populations susceptible to serious adverse drug reactions implementing the methodology of a chemical-protein interactome. Nucleic Acids Res.37, W406–W412. 10.1093/nar/gkp312
40
YangL.WangK. J.WangL. S.JeggaA. G.QinS. Y.HeG.et al. (2011). Chemical-protein interactome and its application in off-target identification. Interdiscip. Sci.3, 22–30. 10.1007/s12539-011-0051-8

Summary

Keywords

polypharmacology, molecular docking simulation, gene expression, mechanism of drug action, drug target

Citation

Shmelkov E, Grigoryan A, Swetnam J, Xin J, Tivon D, Shmelkov SV and Cardozo T (2015) Historeceptomic Fingerprints for Drug-Like Compounds. Front. Physiol. 6:371. doi: 10.3389/fphys.2015.00371

Received

18 August 2015

Accepted

20 November 2015

Published

18 December 2015

Volume

6 - 2015

Edited by

Pierre De Meyts, De Meyts R&D Consulting, Belgium

Reviewed by

Steven G. Gray, St. James Hospital/Trinity College Dublin, Ireland; Irina Kufareva, University of California, San Diego, USA

Updates

This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Timothy Cardozo timothy.cardozo@nyumc.org

This article was submitted to Systems Biology, a section of the journal Frontiers in Physiology

†These authors have contributed equally to this work.

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Systems Biology Archive

ORIGINAL RESEARCH article

Historeceptomic Fingerprints for Drug-Like Compounds

Abstract

Introduction