PeptideManager: a peptide selection tool for targeted proteomic studies involving mixed samples from different species

Demeure, Kevin; Duriez, Elodie; Domon, Bruno; Niclou, Simone P.

doi:10.3389/fgene.2014.00305

METHODS article

Front. Genet., 02 September 2014

Sec. Neurogenomics

Volume 5 - 2014 | https://doi.org/10.3389/fgene.2014.00305

PeptideManager: a peptide selection tool for targeted proteomic studies involving mixed samples from different species

$\r\nKevin Demeure*$ Kevin Demeure¹^*

Elodie Duriez²

Bruno Domon²

Simone P. Niclou¹

¹NorLux Neuro-Oncology Laboratory, Department of Oncology, Centre de Recherche Public de la Santé, Luxembourg, Luxembourg
²LCP, Luxembourg Clinical Proteomics Center, Centre de Recherche Public de la Santé, Strassen, Luxembourg

The search for clinically useful protein biomarkers using advanced mass spectrometry approaches represents a major focus in cancer research. However, the direct analysis of human samples may be challenging due to limited availability, the absence of appropriate control samples, or the large background variability observed in patient material. As an alternative approach, human tumors orthotopically implanted into a different species (xenografts) are clinically relevant models that have proven their utility in pre-clinical research. Patient derived xenografts for glioblastoma have been extensively characterized in our laboratory and have been shown to retain the characteristics of the parental tumor at the phenotypic and genetic level. Such models were also found to adequately mimic the behavior and treatment response of human tumors. The reproducibility of such xenograft models, the possibility to identify their host background and perform tumor-host interaction studies, are major advantages over the direct analysis of human samples. At the proteome level, the analysis of xenograft samples is challenged by the presence of proteins from two different species which, depending on tumor size, type or location, often appear at variable ratios. Any proteomics approach aimed at quantifying proteins within such samples must consider the identification of species specific peptides in order to avoid biases introduced by the host proteome. Here, we present an in-house methodology and tool developed to select peptides used as surrogates for protein candidates from a defined proteome (e.g., human) in a host proteome background (e.g., mouse, rat) suited for a mass spectrometry analysis. The tools presented here are applicable to any species specific proteome, provided a protein database is available. By linking the information from both proteomes, PeptideManager significantly facilitates and expedites the selection of peptides used as surrogates to analyze proteins of interest.

Introduction

Mass spectrometry(MS)-based proteomics provides various approaches (i.e., shotgun, supervised and targeted approaches) (Domon and Aebersold, 2010) in the field of cancer research (Smith, 2012; Deracinois et al., 2013; Marx, 2013) and is nowadays widely used in pre-clinical and clinical investigations (Lee et al., 2011), and for biomarker studies (Li et al., 2011; Meng and Veenstra, 2011; Pan et al., 2012; Waldemarson et al., 2012).

Shotgun proteomics approach is the pipeline followed when biomarker discovery is considered; i.e., protein identification (Eng et al., 1994; Nesvizhskii, 2007) and label-free relative quantification (Bantscheff et al., 2007; Asara et al., 2008; Neilson et al., 2011). Regarding the evaluation (and validation) of biomarker candidates, targeted proteomics is considered for the precise and even the absolute quantification of these candidates (Gallien et al., 2011; Whiteaker et al., 2011; Gillette and Carr, 2013; Marx, 2013). Targeted proteomics is also increasingly used to perform supervised discovery of selected biomarker candidates (Gillette and Carr, 2013; Kim et al., 2013; Marx, 2013; Percy et al., 2014).

In cancer research, the direct analysis of human samples are often hampered because of strong inter-patient variability and limited sample availability. The latter is particularly true for samples requiring invasive sample collection procedures (e.g., biopsies) and is even worse for samples from healthy donors (control samples) (Pesch et al., 2014). Moreover, patient samples are normally limited to one time point and they do not offer the opportunity for a controlled interventional study. To circumvent those restrictions, animal models consisting in the orthotopical implantation of human tumors into animals (xenografts) have proven their utility as relevant models in many studies (Whiteaker et al., 2007; Huszthy et al., 2012; Tang et al., 2012; Klink et al., 2013). In the search for more biomarkers and more effective treatment options against brain tumors, our lab has developed patient derived xenografts for human glioblastoma in immunodeficient mice and rats (Wang et al., 2009). Those animal models have been extensively characterized and shown to retain the characteristics of the parental tumor at the phenotypic and genetic level (Niclou et al., 2008; Rajcevic et al., 2009; Golebiewska et al., 2013). Such models were also found to adequately mimic the behavior and treatment response of human tumors (Keunen et al., 2011). Another major advantage of these models over human samples is their reproducibility and their identifiable host background allowing a detailed analysis of tumor-host interactions. The use of xenograft samples also gives an easy and direct access to control samples exhibiting a more controlled experimental setup compared to human samples.

In bottom-up MS-based proteomics, peptides are generally used as the representative of the proteins either for identification or quantification purposes (Aebersold and Mann, 2003; Chait, 2006). Peptides are identified and monitored via their mass-to-charge ratio and their fragmentation pattern produced under collision induced dissociation (CID) (peptide ion-types, mass-to-charges and relative intensities) (Eng et al., 1994; Nesvizhskii, 2007). Therefore, it is desirable to target a few representative peptides for each protein of interest. The choice of these peptides used as protein surrogates is therefore crucial to unequivocally identify and quantify the protein of interest. A peptide is considered as proteotypic (Kuster et al., 2005; Mallick et al., 2007) when it fulfills two selection criteria: it is a unique representative of a single protein within the proteome of interest and it possesses a good MS detectability (Brownridge and Beynon, 2011). Such peptides allow umambiguous analysis of the targeted protein. However, the selection of proteotypic peptides is often tedious and time-consuming. It has to carefully take into account several essential criteria (Gallien et al., 2011) including uniqueness of peptide sequence, protein digestion efficiency, enzymatical and chemical modifications at the protein or peptide level, physicochemical behavior of the peptide.

In the context of xenograft samples, the use of MS-based targeted (or supervised) proteomics approaches for protein biomarker studies needs to consider the presence of various proteomes from different species. This leads to more complex samples and a more constrained and restricted choice of the surrogate peptides (Figure 1). The tool PeptideManager was built with the purpose of collecting and combining information from different public protein databases [UniProt (SwissProt or TrEMBL or both (UniProt)), RefSeq or IPI] and/or species to facilitate and speed up the surrogate peptide selection compared to the manual selection process. The proposed software is specially designed for the selection of surrogate peptides in the cases involving various species proteomes such as in xenografts. Obviously, it can also be used in the general case when a single proteome is implicated.

FIGURE 1

Figure 1. Schematic of the protein extraction procedure from a human GBM xenografted in a GFP-expressing immunodeficient mouse. Since the excision of the human tumor tissue (in blue) includes a variable proportion of mouse cells (in green), the surrogate peptide selection process of a given protein A must exclusively consider the human specific peptide (blue peptides). Monitoring protein A via human specific peptides in different tumor pieces will eliminate the bias induced by the presence of mouse proteins within the samples.

General Consideration on Peptide Selection

The separation, detection and characterization by liquid chromatography (LC)-MS/MS of a proteotypic peptide of a given protein allows for confident identification given that the peptide identification is reliable (via the m/z of its precursor and of its fragments). However, for quantitative purposes, the choice of a surrogate peptide for a given protein is not trivial since the sequence uniqueness of the peptide is not the only pre-requisite (Gallien et al., 2011). Indeed, several parameters need to be considered to warrant the selection of peptides that reliably represent the targeted protein at the quantitative level (Lange et al., 2008; Gallien et al., 2011). Such peptides are termed proteotypic (Kuster et al., 2005; Mallick et al., 2007) and are characterized by two major conditions; uniqueness of its sequence within the proteome of interest and a good detectability in LC-MS(/MS). The good detectability of peptides in MS is influenced by its physicochemical properties and other experimental-related parameters, which are described hereafter.

Ideally, the enzymatic digestion of the proteins should not introduce any bias. However, since the yield of an enzymatic reaction is rarely complete, the enzymatic digestion itself introduces some variability across samples (Brownridge and Beynon, 2011; Loziuk et al., 2013). This can be limited in two ways. Firstly, a precise control of the experimental conditions will limit the variability across sample replicates. Secondly, since the digestion efficiency is not identical for all the peptides derived from the same protein, a rational choice of the peptides exhibiting the best cleavage propensities should be performed. The yield of the proteolytic cleavage depends on the protease and on the amino acid composition around the cleavage sites. The cleavage efficiencies can be estimated theoretically for some proteases and various web-tools are freely available to predict which proteolytic peptide bonds are likely to be missed by the standard enzymes [e.g., PeptideCutter web-tool on ExPASy (http://web.expasy.org/peptide_cutter/) (Eyers et al., 2011; Artimo et al., 2012; Lawless and Hubbard, 2012)].

The reduction and the alkylation steps of the cysteines in proteins are intended to make cleavage sites accessible that would otherwise be hindered. Nevertheless, as any chemical reaction, this sample preparation procedure can introduce unpredictable biases because of an incomplete reaction. Therefore, peptides containing amino acids that are prone to chemical modification [e.g., cysteines (incomplete reduction/alkylation) and methionine (partially oxidized)] should be avoided (Bischoff and Schluter, 2012). Correspondingly, amino acids with the potential for enzymatic post-translational modifications (PTMs) should be avoided if the analysis of those modifications is not in the scope of the study. Peptides containing sequence uncertainties (e.g., sequence conflicts between public protein databases) or amino acids showing variability due to single-nucleotide polymorphisms (SNPs) should be discarded as well.

Depending on whether all isoforms or a particular isoform of a given protein are of interest, either the selected peptides should be representative of all the isoforms of the protein or the selected peptides should be representative of only one specific protein isoform.

The mass-to-charge ratio of the selected peptide should comply with the mass range restriction of the mass spectrometer(s) that will be used for the subsequent analyses (e.g., m/z between 400 and 1600 on a triple quadrupole platform). Moreover, very short peptides are less likely to have a unique sequence and they are more susceptible to interferences. On the other hand, lengthy peptides are not desirable due to their hydrophobicity and to issues regarding their synthesis and purification. Typically, peptide ranging from 5 to around 22 amino acids are best suited for quantification purpose (Gallien et al., 2011).

Various tools are currently available for the theoretical or semi-empirical estimation of the proteotypic properties of a peptide sequence including the digestion efficacy according to the cleavage site (Kuster et al., 2005; Mallick et al., 2007; Fusaro et al., 2009; Eyers et al., 2011; Lawless and Hubbard, 2012; Mohammed et al., 2014; Qeli et al., 2014). However, these tools do not handle samples containing proteins from multiple species that therefore require a very time-consuming manual selection of surrogate peptide candidates. The proposed tool was therefore intended to automate this selection step of unique peptide candidates by combining all relevant information present in public protein databases to help the filtering of inappropriate peptides (e.g., peptide sequence present in both proteomes).

The tool PeptideManager was designed to: (1) build and store peptide databases from public repositories, (2) link available information (e.g., PTM sites, SNPs, signal peptide) from public databases at the peptide level, (3) allow queries and peptide pre-selection within those databases, and most importantly (4) perform peptide pre-selection within a given proteome of interest while taking into account the presence of another species proteome within the sample (Figure 2).

FIGURE 2

Figure 2. Scheme illustrating how PeptideManager is operating. Required information from the public database is extracted to produce the peptide database (SQLite format) (A). Created peptide databases are stored and can be used for peptide selection queries (B,C). The peptide selection can be done with (B) or without (C) the presence of a background proteome. It is noteworthy that the presence of a background proteome (as in the case of xenografts) (B) will generally reduce the number of surrogate peptide candidates compared to the list of peptides obtained in (C).

Materials and Methods

Glioblastoma samples were collected at the Neurosurgery Department of the Center Hospitalier in Luxembourg (CHL) from patients having given their informed consent. Collection and use of patient tumor material has been approved by the National Ethics Committee for Research (CNER) of Luxembourg. All animal procedures were approved by the national authorities responsible for animal experiments in Luxembourg.

Peptide Database Creation

The peptide selection tool PeptideManager was written in C# language and the GUI (graphical user interface) was designed with Visual Studio Express 2010. The databases were built in the SQLite format (http://www.sqlite.org/about.html). PeptideManager can be run on any computer on which the Microsoft.NET framework is installed. PeptideManager and its user guide are freely available at http://peptidemanager-lrno.sourceforge.net.

The peptide databases (in SQLite format) derived from different public protein databases can be built by PeptideManager, namely UniProt (Uniprot, 2009, 2014; Magrane and Consortium, 2011), RefSeq (Pruitt et al., 2007, 2012, 2014) and IPI (Kersey et al., 2004). The different file formats requested to build a peptide database are indicated in Table 1. Protein sequences are digested in silico by the proteolytic enzyme selected (trypsin, Lys-C, Arg-C or Lys-N). The required information present in the public database is extracted and inserted into the PeptideManager database. If present in the public protein data repository, information about PTMs, sequence conflicts or other sequence related modifications are extracted and inserted at the peptide level in the newly generated database. In order to limit the size of the created peptide database, a minimal peptide length (3–5 amino acids) as well as the number of allowed miscleavages (0–2) can be selected. Once created, the database is indexed in PeptideManager and is immediately available for search requests.

TABLE 1

Table 1. List of the different file formats supported by PeptideManager for the different public protein database repositories.

Contrary to similar tools like PeptidePicker (Mohammed et al., 2014), CONSeQuence (Eyers et al., 2011) or even Skyline (Maclean et al., 2010), PeptideManager is solely intended to build a peptide database, to allow search queries within these databases and to help the user to perform unique peptide selection for targeted experiments. The advantages of PeptideManager are to allow the user to define a host/background proteome, to import databases from UniProt, RefSeq and IPI, to extract all the available information (protein features) and to suggest default filters for the selection of unique peptide sequences (with or without the presence of a host/background proteome).

Surrogate Peptide Selection

As shown in Figure 3, search queries can be performed with protein accession numbers (individually or in batch mode), protein name or peptide sequences (individually or in batch mode). The displayed results can be filtered against unsuitable amino acids or lengths for example. The results can be directly saved in csv file format. When only one species proteome is studied, this type of search request provides all the information needed to select surrogate peptide candidates (unique peptide sequences) for the given protein. Moreover, this type of request can be used to define the uniqueness of a list of detected peptides from experimental evidences and confirm the identification of the related proteins.

FIGURE 3

Figure 3. Print screen of the results displayed by PeptideManager for the P13645 human protein. Search queries can be performed by protein ID (A), by protein name (B) or by peptide sequence(s) (C). The list of peptides obtained can be filtered out according to the length (e.g., 5 a.a. ≤ peptide length ≤ 22 a.a.), the presence of unwanted amino acid (methionine-containing peptide for example) or the frequency of the peptide sequence within the database (D). The results can be exported in csv file format (E).

The major advantage of PeptideManager is to possess a search function specifically dedicated to peptide selection when various species proteomes are involved (Figure 1). In addition to the previous search request, the proteome of interest and the proteome of the host species (background proteome) can be selected (Figure 4). In this case, the information will be gathered by PeptideManager within the peptide databases of both proteomes. The protein accession number is used to extract all peptide information related to the targeted protein in the database of the proteome of interest. Subsequently, the peptide sequences are searched against the peptide database of the host species proteome to determine the number of occurrences (hits) within the background proteome. In order to monitor/quantify a protein of interest without any bias coming from the host/background proteome, only peptides that are unique sequences (i.e., unique representative of the protein of interest) within the proteome of interest and absent within the host/background proteome should be selected. An example of a result file (csv format) is partly displayed in Figure 4.

FIGURE 4

Figure 4. Print screen showing the results obtained with PeptideManager for the human P13645 protein within the mouse proteome (host proteome). The information concerning both proteomes [selected proteomes in (A,B)] is brought together and the number of observation (hits) of the peptide sequence is indicated in each proteome. Human specific peptide candidates are those with one hit in the human proteome [selected in (A)] and no hit in the mouse proteome [selected in (B)] (e.g., HGNSHQGEPR). The results (filtered or not) can be saved in a csv file.

As mentioned, the software is not only intended for the selection of unique sequences of a given protein but also to perform any kind of search queries within the databases. In order to keep all the information available for the user, filters are available and can be used/or not according to the purpose of the query. However, in the case of the selection of unique peptide sequences (with or without the presence of a host proteome), it is possible to apply “advised default values” of the filters. A comprehensive user guide is provided as Supplementary Material. This user guide explains the different steps to use correctly the software and includes several case studies in order to exemplify different situations that users could meet, e.g., protein isoform differentiation, post-translational modification monitoring, unique peptide selection with or without the presence of a host/background proteome.

PeptideManager was used for the selection of surrogate peptides to develop SRM (selected reaction monitoring) assays on a triple-quadrupole platform for the targeted analysis of protein biomarker candidates. This was done in the context of human GBM xenografts (mice and rats) and PeptideManager greatly facilitated and expedited the selection of unique peptide from mixed samples involving different species proteomes. Compared to the few proteins per day for which unique peptide selection can be done manually, PeptideManager succeeded in performing the selection of peptide candidates for hundreds of proteins a day.

In order to further validate the selected peptides and/or decrease further the number of peptide candidates, additional information, not present in the public protein databases, such as the enzymatic digestion efficiencies and the LC-MS(/MS) behaviors of the peptide candidates can be used. From the selected peptides, predictive computational tools (Artimo et al., 2012; Lawless and Hubbard, 2012) can be used to filter out peptides arising from poorly efficient cleavage sites that would lead to an erroneous values of peptide/protein amount as well as a decrease in sensitivity. MS-based data from the proteomic community are freely available in public data repositories such as PeptideAtlas (Desiere et al., 2006), GPM Proteomics Database (Craig et al., 2004, 2005) or PRIDE (Vizcaino et al., 2013) that permit to estimate the MS-behaviors of peptide sequence candidates.

In these public databases, proteomics data can be retrieved by protein accession number, peptide sequence, species or even sample type (e.g., brain, kidney, liver). For a given protein, all the previously detected peptides identifying this protein are displayed; however they may not be unique and may contain miscleavages. Using the list of pre-selected unique peptide candidates from PeptideManager will allow to rapidly access the pertinent information from the data provided by those public repositories. Additional empirical and theoretical information (available in PeptideAtlas) may help to select the best candidates. For example, how many times a peptide sequence was detected within a given set of experiments can help to estimate the detectability of this peptide by LC-MS (due to digestion efficacy, good ionization, LC or MS behavior, etc.). Theoretical scores based on various parameters (e.g., amino acid composition and hydrophobicity) and algorithms can give theoretical estimations of the LC-MS behavior and detectability of the peptides (Mallick et al., 2007; Fusaro et al., 2009; Eyers et al., 2011; Qeli et al., 2014). All this information can be used to further rationalize the peptide selection and keep those peptides predicted to demonstrate the best LC-MS behavior and leading to the MS measurements with the highest possible sensitivity. If MS/MS spectra are available, they can be used to select the most appropriate charge state of the peptide to be monitored. In the context of the development of an SRM assay, the fragmentation spectra can be used to select the most intense fragments for a given peptide which should lead to an optimal sensitivity for that peptide when monitoring those transitions by SRM. Reference transitions for a large number of peptides are publicly available in the SRMAtlas repository (Picotti et al., 2008, 2013; Huttenhain et al., 2013). However, those transitions need to be validated (e.g., absence of interferences) within the biological samples of interest.

Conclusions and Perspectives

PeptideManager was used for the selection of unique peptide candidates as protein surrogates in the context of a supervised evaluation (SRM assays on a triple-quadrupole platform) of protein biomarker candidates in human GBM xenografts (mice and rats). This tool greatly facilitated and expedited the peptide selection for these mixed samples involving different species proteomes. Moreover, it is applicable to complex samples (with or without the presence of a host (background) proteome) of all species for which a protein database is available.

PeptideManager could be extended to include additional information helping to the peptide selection by directly inserting the information from MS-based proteomics data public repositories (e.g., PeptideAtlas) and the theoretical information about the enzymatic cleavage efficiency (e.g., PeptideCutter) within the peptide databases.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

This work was supported by a grant from FNR (Fonds National de la Recherche) of Luxembourg to Simone P. Niclou (CORE Project ESCAPE). The authors would like to thank Anna Golebiewska, Sébastien Bougnaud and Anaïs Oudin for the images of the mouse brain section.

Supplementary Material

The Supplementary Material for this article can be found online at: http://www.frontiersin.org/journal/10.3389/fgene.2014.00305/abstract

References

Aebersold, R., and Mann, M. (2003). Mass spectrometry-based proteomics. Nature 422, 198–207. doi: 10.1038/nature01511

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Artimo, P., Jonnalagedda, M., Arnold, K., Baratin, D., Csardi, G., De Castro, E., et al. (2012). ExPASy: SIB bioinformatics resource portal. Nucleic Acids Res. 40, W597–W603. doi: 10.1093/nar/gks400

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Asara, J. M., Christofk, H. R., Freimark, L. M., and Cantley, L. C. (2008). A label-free quantification method by MS/MS TIC compared to SILAC and spectral counting in a proteomics screen. Proteomics 8, 994–999. doi: 10.1002/pmic.200700426

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Bantscheff, M., Schirle, M., Sweetman, G., Rick, J., and Kuster, B. (2007). Quantitative mass spectrometry in proteomics: a critical review. Anal. Bioanal. Chem. 389, 1017–1031. doi: 10.1007/s00216-007-1486-6

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Bischoff, R., and Schluter, H. (2012). Amino acids: chemistry, functionality and selected non-enzymatic post-translational modifications. J. Proteomics 75, 2275–2296. doi: 10.1016/j.jprot.2012.01.041

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Brownridge, P., and Beynon, R. J. (2011). The importance of the digest: proteolysis and absolute quantification in proteomics. Methods 54, 351–360. doi: 10.1016/j.ymeth.2011.05.005

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Chait, B. T. (2006). Chemistry. Mass spectrometry: bottom-up or top-down? Science 314, 65–66. doi: 10.1126/science.1133987

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Craig, R., Cortens, J. P., and Beavis, R. C. (2004). Open source system for analyzing, validating, and storing protein identification data. J. Proteome Res. 3, 1234–1242. doi: 10.1021/pr049882h

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Craig, R., Cortens, J. P., and Beavis, R. C. (2005). The use of proteotypic peptide libraries for protein identification. Rapid Commun. Mass Spectrom. 19, 1844–1850. doi: 10.1002/rcm.1992

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Deracinois, B., Flahaut, C., Duban-Deweer, S., and Karamanos, Y. (2013). Comparative and quantitative global proteomics approaches: an overview. Proteomes 1, 180–218. doi: 10.3390/proteomes1030180

CrossRef Full Text

Desiere, F., Deutsch, E. W., King, N. L., Nesvizhskii, A. I., Mallick, P., Eng, J., et al. (2006). The peptideatlas project. Nucleic Acids Res. 34, D655–D658. doi: 10.1093/nar/gkj040

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Domon, B., and Aebersold, R. (2010). Options and considerations when selecting a quantitative proteomics strategy. Nat. Biotechnol. 28, 710–721. doi: 10.1038/nbt.1661

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Eng, J. K., McCormack, A. L., and Yates, J. R. (1994). An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database. J. Am. Soc. Mass Spectrom. 5, 976–989. doi: 10.1016/1044-0305(94)80016-2

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Eyers, C. E., Lawless, C., Wedge, D. C., Lau, K. W., Gaskell, S. J., and Hubbard, S. J. (2011). CONSeQuence: prediction of reference peptides for absolute quantitative proteomics using consensus machine learning approaches. Mol. Cell. Proteomics 10:M110.003384. doi: 10.1074/mcp.M110.003384

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Fusaro, V. A., Mani, D. R., Mesirov, J. P., and Carr, S. A. (2009). Prediction of high-responding peptides for targeted protein assays by mass spectrometry. Nat. Biotechnol. 27, 190–198. doi: 10.1038/nbt.1524

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Gallien, S., Duriez, E., and Domon, B. (2011). Selected reaction monitoring applied to proteomics. J. Mass Spectrom. 46, 298–312. doi: 10.1002/jms.1895

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Gillette, M. A., and Carr, S. A. (2013). Quantitative analysis of peptides and proteins in biomedicine by targeted mass spectrometry. Nat. Methods 10, 28–34. doi: 10.1038/nmeth.2309

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Golebiewska, A., Bougnaud, S., Stieber, D., Brons, N. H., Vallar, L., Hertel, F., et al. (2013). Side population in human glioblastoma is non-tumorigenic and characterizes brain endothelial cells. Brain 136, 1462–1475. doi: 10.1093/brain/awt025

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Huszthy, P. C., Daphu, I., Niclou, S. P., Stieber, D., Nigro, J. M., Sakariassen, P. O., et al. (2012). In vivo models of primary brain tumors: pitfalls and perspectives. Neuro. Oncol. 14, 979–993. doi: 10.1093/neuonc/nos135

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Huttenhain, R., Surinova, S., Ossola, R., Sun, Z., Campbell, D., Cerciello, F., et al. (2013). N-glycoprotein SRMAtlas: a resource of mass spectrometric assays for N-glycosites enabling consistent and multiplexed protein quantification for clinical applications. Mol. Cell. Proteomics 12, 1005–1016. doi: 10.1074/mcp.O112.026617

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Kersey, P. J., Duarte, J., Williams, A., Karavidopoulou, Y., Birney, E., and Apweiler, R. (2004). The international protein index: an integrated database for proteomics experiments. Proteomics 4, 1985–1988. doi: 10.1002/pmic.200300721

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Keunen, O., Johansson, M., Oudin, A., Sanzey, M., Rahim, S. A., Fack, F., et al. (2011). Anti-VEGF treatment reduces blood supply and increases tumor cell invasion in glioblastoma. Proc. Natl. Acad. Sci. U.S.A. 108, 3749–3754. doi: 10.1073/pnas.1014480108

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Kim, Y. J., Gallien, S., Van Oostrum, J., and Domon, B. (2013). Targeted proteomics strategy applied to biomarker evaluation. Proteomics Clin. Appl. 7, 739–747. doi: 10.1002/prca.201300070

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Klink, B., Miletic, H., Stieber, D., Huszthy, P. C., Valenzuela, J. A., Balss, J., et al. (2013). A novel, diffusely infiltrative xenograft model of human anaplastic oligodendroglioma with mutations in FUBP1, CIC, and IDH1. PLoS ONE 8:e59773. doi: 10.1371/journal.pone.0059773

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Kuster, B., Schirle, M., Mallick, P., and Aebersold, R. (2005). Scoring proteomes with proteotypic peptide probes. Nat. Rev. Mol. Cell Biol. 6, 577–583. doi: 10.1038/nrm1683

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Lange, V., Picotti, P., Domon, B., and Aebersold, R. (2008). Selected reaction monitoring for quantitative proteomics: a tutorial. Mol. Syst. Biol. 4:222. doi: 10.1038/msb.2008.61

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Lawless, C., and Hubbard, S. J. (2012). Prediction of missed proteolytic cleavages for the selection of surrogate peptides for quantitative proteomics. OMICS 16, 449–456. doi: 10.1089/omi.2011.0156

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Lee, J. M., Han, J. J., Altwerger, G., and Kohn, E. C. (2011). Proteomics and biomarkers in clinical trials for drug development. J. Proteomics 74, 2632–2641. doi: 10.1016/j.jprot.2011.04.023

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Li, J., Kelm, K. B., and Tezak, Z. (2011). Regulatory perspective on translating proteomic biomarkers to clinical diagnostics. J. Proteomics 74, 2682–2690. doi: 10.1016/j.jprot.2011.07.028

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Loziuk, P. L., Wang, J., Li, Q., Sederoff, R. R., Chiang, V. L., and Muddiman, D. C. (2013). Understanding the role of proteolytic digestion on discovery and targeted proteomic measurements using liquid chromatography tandem mass spectrometry and design of experiments. J. Proteome Res. 12, 5820–5829. doi: 10.1021/pr4008442

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Maclean, B., Tomazela, D. M., Shulman, N., Chambers, M., Finney, G. L., Frewen, B., et al. (2010). Skyline: an open source document editor for creating and analyzing targeted proteomics experiments. Bioinformatics 26, 966–968. doi: 10.1093/bioinformatics/btq054

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Magrane, M., and Consortium, U. (2011). UniProt Knowledgebase: a hub of integrated protein data. Database (Oxford) 2011:bar009. doi: 10.1093/database/bar009

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Mallick, P., Schirle, M., Chen, S. S., Flory, M. R., Lee, H., Martin, D., et al. (2007). Computational prediction of proteotypic peptides for quantitative proteomics. Nat. Biotechnol. 25, 125–131. doi: 10.1038/nbt1275

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Marx, V. (2013). Targeted proteomics. Nat. Methods 10, 19–22. doi: 10.1038/nmeth.2285

CrossRef Full Text

Meng, Z., and Veenstra, T. D. (2011). Targeted mass spectrometry approaches for protein biomarker verification. J. Proteomics 74, 2650–2659. doi: 10.1016/j.jprot.2011.04.011

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Mohammed, Y., Domanski, D., Jackson, A. M., Smith, D. S., Deelder, A. M., Palmblad, M., et al. (2014). PeptidePicker: a scientific workflow with web interface for selecting appropriate peptides for targeted proteomics experiments. J. Proteomics 106C, 151–161. doi: 10.1016/j.jprot.2014.04.018

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Neilson, K. A., Ali, N. A., Muralidharan, S., Mirzaei, M., Mariani, M., Assadourian, G., et al. (2011). Less label, more free: approaches in label-free quantitative mass spectrometry. Proteomics 11, 535–553. doi: 10.1002/pmic.201000553

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Nesvizhskii, A. I. (2007). Protein identification by tandem mass spectrometry and sequence database searching. Methods Mol. Biol. 367, 87–119. doi: 10.1385/1-59745-275-0:87

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Niclou, S. P., Danzeisen, C., Eikesdal, H. P., Wiig, H., Brons, N. H., Poli, A. M., et al. (2008). A novel eGFP-expressing immunodeficient mouse model to study tumor-host interactions. FASEB J. 22, 3120–3128. doi: 10.1096/fj.08-109611

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Pan, S., Chen, R., Brand, R. E., Hawley, S., Tamura, Y., Gafken, P. R., et al. (2012). Multiplex targeted proteomic assay for biomarker detection in plasma: a pancreatic cancer biomarker case study. J. Proteome Res. 11, 1937–1948. doi: 10.1021/pr201117w

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Percy, A. J., Chambers, A. G., Yang, J., Hardie, D. B., and Borchers, C. H. (2014). Advances in multiplexed MRM-based protein biomarker quantitation toward clinical utility. Biochim. Biophys. Acta 1844, 917–926. doi: 10.1016/j.bbapap.2013.06.008

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Pesch, B., Bruning, T., Johnen, G., Casjens, S., Bonberg, N., Taeger, D., et al. (2014). Biomarker research with prospective study designs for the early detection of cancer. Biochim. Biophys. Acta 1844, 874–883. doi: 10.1016/j.bbapap.2013.12.007

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Picotti, P., Clement-Ziza, M., Lam, H., Campbell, D. S., Schmidt, A., Deutsch, E. W., et al. (2013). A complete mass-spectrometric map of the yeast proteome applied to quantitative trait analysis. Nature 494, 266–270. doi: 10.1038/nature11835

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Picotti, P., Lam, H., Campbell, D., Deutsch, E. W., Mirzaei, H., Ranish, J., et al. (2008). A database of mass spectrometric assays for the yeast proteome. Nat. Methods 5, 913–914. doi: 10.1038/nmeth1108-913

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Pruitt, K. D., Brown, G. R., Hiatt, S. M., Thibaud-Nissen, F., Astashyn, A., Ermolaeva, O., et al. (2014). RefSeq: an update on mammalian reference sequences. Nucleic Acids Res. 42, D756–D763. doi: 10.1093/nar/gkt1114

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Pruitt, K. D., Tatusova, T., Brown, G. R., and Maglott, D. R. (2012). NCBI reference sequences (RefSeq): current status, new features and genome annotation policy. Nucleic Acids Res. 40, D130–D135. doi: 10.1093/nar/gkr1079

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Pruitt, K. D., Tatusova, T., and Maglott, D. R. (2007). NCBI reference sequences (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins. Nucleic Acids Res. 35, D61–D65. doi: 10.1093/nar/gkl842

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Qeli, E., Omasits, U., Goetze, S., Stekhoven, D. J., Frey, J. E., Basler, K., et al. (2014). Improved prediction of peptide detectability for targeted proteomics using a rank-based algorithm and organism-specific data. J. Proteomics 108, 269–283. doi: 10.1016/j.jprot.2014.05.011

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Rajcevic, U., Petersen, K., Knol, J. C., Loos, M., Bougnaud, S., Klychnikov, O., et al. (2009). iTRAQ-based proteomics profiling reveals increased metabolic activity and cellular cross-talk in angiogenic compared with invasive glioblastoma phenotype. Mol. Cell. Proteomics 8, 2595–2612. doi: 10.1074/mcp.M900124-MCP200

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Smith, R. D. (2012). Mass spectrometry in biomarker applications: from untargeted discovery to targeted verification, and implications for platform convergence and clinical application. Clin. Chem. 58, 528–530. doi: 10.1373/clinchem.2011.180596

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Tang, H. Y., Beer, L. A., Chang-Wong, T., Hammond, R., Gimotty, P., Coukos, G., et al. (2012). A xenograft mouse model coupled with in-depth plasma proteome analysis facilitates identification of novel serum biomarkers for human ovarian cancer. J. Proteome Res. 11, 678–691. doi: 10.1021/pr200603h

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Uniprot, C. (2009). The universal protein resource (UniProt) 2009. Nucleic Acids Res. 37, D169–D174. doi: 10.1093/nar/gkn664

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Uniprot, C. (2014). Activities at the universal protein resource (UniProt). Nucleic Acids Res. 42, D191–D198. doi: 10.1093/nar/gkt1140

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Vizcaino, J. A., Cote, R. G., Csordas, A., Dianes, J. A., Fabregat, A., Foster, J. M., et al. (2013). The PRoteomics IDEntifications (PRIDE) database and associated tools: status in 2013. Nucleic Acids Res. 41, D1063–D1069. doi: 10.1093/nar/gks1262

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Waldemarson, S., Krogh, M., Alaiya, A., Kirik, U., Schedvins, K., Auer, G., et al. (2012). Protein expression changes in ovarian cancer during the transition from benign to malignant. J. Proteome Res. 11, 2876–2889. doi: 10.1021/pr201258q

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Wang, J., Miletic, H., Sakariassen, P. O., Huszthy, P. C., Jacobsen, H., Brekka, N., et al. (2009). A reproducible brain tumour model established from human glioblastoma biopsies. BMC Cancer 9:465. doi: 10.1186/1471-2407-9-465

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Whiteaker, J. R., Lin, C., Kennedy, J., Hou, L., Trute, M., Sokal, I., et al. (2011). A targeted proteomics-based pipeline for verification of biomarkers in plasma. Nat. Biotechnol. 29, 625–634. doi: 10.1038/nbt.1900

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Whiteaker, J. R., Zhang, H., Zhao, L., Wang, P., Kelly-Spratt, K. S., Ivey, R. G., et al. (2007). Integrated pipeline for mass spectrometry-based discovery and confirmation of biomarkers demonstrated in a mouse model of breast cancer. J. Proteome Res. 6, 3962–3975. doi: 10.1021/pr070202v

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Keywords: mass spectrometry, targeted proteomics, rodent xenografts, human glioblastoma, mixed samples, automated tool, unique peptide selection

Citation: Demeure K, Duriez E, Domon B and Niclou SP (2014) PeptideManager: a peptide selection tool for targeted proteomic studies involving mixed samples from different species. Front. Genet. 5:305. doi: 10.3389/fgene.2014.00305

Received: 18 June 2014; Accepted: 16 August 2014;
Published online: 02 September 2014.

Edited by:

Yannis Karamanos, Université d'Artois, France

Reviewed by:

Scott E. Hemby, Wake Forest University School of Medicine, USA
Amit Kumar Yadav, Translational Health Science and Technology Institute, India
Gwenael Pottiez, Neuro-Bio Ltd, UK

Copyright © 2014 Demeure, Duriez, Domon and Niclou. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Kevin Demeure, NorLux Neuro-Oncology Laboratory, Department of Oncology, Centre de Recherche Public de la Santé, 84 Val Fleuri, L-1526 Luxembourg, Luxembourg e-mail: kevin.demeure@crp-sante.lu

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.