Managing the green proteomes for the next decade of plant research

Carroll, Andrew  W; Joshi, Hiren  J; Heazlewood, Joshua  L

doi:10.3389/fpls.2013.00501

EDITORIAL article

Front. Plant Sci., 16 December 2013

Sec. Plant Proteomics and Protein Structural Biology

Volume 4 - 2013 | https://doi.org/10.3389/fpls.2013.00501

Managing the green proteomes for the next decade of plant research

AW
Andrew W. Carroll ¹
HJ
Hiren J. Joshi ²
JL
Joshua L. Heazlewood ¹^*

1. Department of Cellular and Molecular Medicine, Copenhagen Center for Glycomics, University of Copenhagen Copenhagen, Denmark
2. Physical Biosciences Division and Joint BioEnergy Institute, Lawrence Berkeley National Laboratory Berkeley, CA, USA

For the past decade the field of proteomics has transitioned from a highly specialized research area into a conventional technique widely employed by plant biologists. This approach now encompasses basic protein identification to advanced comparative studies. The result has been an abundance of proteomics data, often not readily available to the research community (Heazlewood, 2011). This has resulted in the creation of numerous proteomic resources which are often referred to as boutique databases. Generally, these sites exist outside the traditional community driven centralized repositories. While the geographic location of web-based resources is somewhat inconsequential, it can highlight active regions of plant proteomic-based research. The intention of this Research Topic on Plant Proteomic Resources is to collect articles focusing on these resources and provide an overview of current online plant proteomic portals.

Plant proteomic resources are often integrative and comprise collections of diverse ‘omics information to support the proteomic data. A good example of this is the GabiPD portal (Usadel et al., 2012), the website is a gateway for the German plant community to codify and unite various research programs at one site. More focused resources such as pep2pro was constructed to support large-scale proteomic surveys in the model plant Arabidopsis (Hirsch-Hoffmann et al., 2012). The pep2pro repository employs a unique workflow to match spectral data directly against the Arabidopsis genome. Although techniques for arraying proteins by 2-DE have been employed for decades, the GelMap portal links proteomic-based identifications with gel electrophoresis maps (Senkler and Braun, 2012). The GelMap resource provides annotated two-dimensional arrays of proteins from a range of sample types.

The application of proteomics to characterize organelles were some of the first large-scale surveys in plants. The AT_CHLORO database represents the most extensive analysis of the chloroplast from the model plant Arabidopsis (Bruley et al., 2012). This resource provides a compendium of proteins identified in the chloroplast and contains information on its sub-compartments e.g., thylakoid. Organelle proteome databases such as AT_CHLORO comprised many of the early online plant proteomic databases including the mitochondrion (Heazlewood and Millar, 2005) and the peroxisome (Reumann et al., 2004). The latter was recently used to develop a new resource, PredPlantPTS1, which predicts whether a protein will localize to the peroxisome (Reumann et al., 2012). The SUBcellular Arabidopsis database (SUBA) contains data from most subcellular proteomic surveys in Arabidopsis (Tanz et al., 2013). A similarly focused resource, the Plant Protein DataBase (PPDB) also deals with subcellular proteomics but also encompasses other plant species (Sun et al., 2009). Although the latter two resources are not part of this collection, data housed by these repositories are available through the MASCP Gator, a portal designed to aggregate Arabidopsis proteomic data for the community. The MASCP Gator interface was developed to provide a mechanism for proteomic data visualization from multiple data sources (Mann et al., 2013).

The model plant Arabidopsis dominates the plant proteomic resource landscape, but as genomic information in other plant species becomes available, databases for other species have been established. The rice RNA-binding protein resource provides a curated collection of over 250 experimentally identified RNA interacting proteins from rice (Doroshenk et al., 2012), providing functional annotation, expression, and phylogenetic relationships. Large-scale developmental and organ specific analyses of the rice proteome has now, also been conducted. These data are available through the rice proteogenomics database (OryzaPG-DB) which provides a visual relationship between the genome and the identified proteome (Helmy et al., 2012). The Soybean Proteome Database (SPD) initially focused on curating proteins that were responsive to flooding (Ohyanagi et al., 2012), but it now includes a host of 2-DE arrayed organelle proteomes, expression information and information on other stress induced proteins from this important leguminous crop.

Seed development represents a major agricultural focus for plant researchers and as such, this developmental process has been extensively targeted by proteomic surveys. The seed proteome web portal provides an extensive collection of data, including quantitative information on proteins involved in seed development (Galland et al., 2012). As is the case with the Seeds of Chernobyl resource, which highlights a different aspect of seed development in plants, namely cataloging the effects of ionizing radiation on seed maturation and development (Klubicova et al., 2012).

Post-translational modifications (PTMs) often represent the functional state of a protein and are a significant objective for many proteomic studies. A number of resources have been developed to interact with these phosphoproteomic datasets. Initial phosphoproteomic surveys involved Arabidopsis and one of the first phosphorylation-based databases created in any species was PhosPhAt (Arsova and Schulze, 2012). The resource contains many thousands of experimentally identified sites available in the literature. The expansion of phosphoproteomic surveys outside Arabidopsis has resulted in the creation of two further resources, the P³DB database houses tens of thousands of phosphopeptides from six plant species. The collection of such an array of data by P³DB led to the development of Musite, a utility that predicts phosphorylation sites in plant proteins (Yao et al., 2012). Lastly, the Medicago PhosphoProtein Database houses data from a recent large-scale phosphoproteomic analysis of this model legume plant (Rose et al., 2012).

The proteomics community has created an array of online tools that can be used to support various technical approaches in mass spectrometry. The MRMaid utility was designed to facilitate the selection of peptides for targeted proteomic analyses (Fan et al., 2012). The tool leverages plant spectral information housed in the PRIDE repository (Vizcaino et al., 2013) to assist in the selection of protein specific peptides for multiple reaction monitoring (MRM) of plant samples. In a similar vein, the ProMEX resource enables newly collected tandem mass spectrometry data to be queried against previously matched experimental spectra (Wienkoop et al., 2012). Spectral matching process provides real world context as tandem mass spectra generally do not produce evenly distributed fragment ions.

The range of proteome resources highlighted in this Research Topic reflect the diversity of proteomic-based applications in plant sciences. The principle objective for many these research groups has been focused on cataloguing or collecting data in an effort to capture information. Indeed, the creation of many data portals likely reflects an attempt to make sense of one's own data. This collection highlights the diversity and range of plant proteomic resources and utilities available to the plant research community.

Statements

Acknowledgments

This work conducted by the Joint BioEnergy Institute was supported by the Office of Science, Office of Biological and Environmental Research, of the U.S. Department of Energy under Contract No. DE-AC02-05CH11231.

References

1
ArsovaB.SchulzeW. X. (2012). Current status of the plant phosphorylation site database PhosPhAt and its use as a resource for molecular plant physiology. Front. Plant Sci. 3:132. 10.3389/fpls.2012.00132
2
BruleyC.DupierrisV.SalviD.RollandN.FerroM. (2012). AT_CHLORO: A chloroplast protein database dedicated to sub-plastidial localization. Front. Plant Sci. 3:205. 10.3389/fpls.2012.00205
3
DoroshenkK. A.CroftsA. J.MorrisR. T.WyrickJ. J.OkitaT. W. (2012). RiceRBP: a Resource for Experimentally Identified RNA Binding Proteins in Oryza sativa. Front. Plant Sci. 3:90. 10.3389/fpls.2012.00090
4
FanJ.MoharebF.JonesA. M.BessantC. (2012). MRMaid: The SRM assay design tool for Arabidopsis and other species. Front. Plant Sci. 3:164. 10.3389/fpls.2012.00164
5
GallandM.JobD.RajjouL. (2012). The seed proteome web portal. Front. Plant Sci. 3:98. 10.3389/fpls.2012.00098
6
HeazlewoodJ. L. (2011). The green proteome: challenges in plant proteomics. Front. Plant Sci. 2:6. 10.3389/fpls.2011.00006
7
HeazlewoodJ. L.MillarA. H. (2005). AMPDB: the Arabidopsis mitochondrial protein database. Nucleic Acids Res. 33, D605–D610. 10.1093/nar/gki048
8
HelmyM.SugiyamaN.TomitaM.IshihamaY. (2012). The rice proteogenomics database OryzaPG-DB: development, expansion, and new features. Front. Plant Sci. 3:65. 10.3389/fpls.2012.00065
9
Hirsch-HoffmannM.GruissemW.BaerenfallerK. (2012). pep2pro: the high-throughput proteomics data processing, analysis, and visualization tool. Front. Plant Sci. 3:123. 10.3389/fpls.2012.00123
10
KlubicovaK.VeselM.RashydovN. M.HajduchM. (2012). Seeds in Chernobyl: the database on proteome response on radioactive environment. Front. Plant Sci. 3:231. 10.3389/fpls.2012.00231
11
MannG. W.CalleyP. C.JoshiH. J.HeazlewoodJ. L. (2013). MASCP Gator: an overview of the Arabidopsis proteomic aggregation portal. Front. Plant Sci. 4:411. 10.3389/fpls.2013.00411
12
OhyanagiH.SakataK.KomatsuS. (2012). Soybean Proteome Database 2012: update on the comprehensive data repository for soybean proteomics. Front. Plant Sci. 3:110. 10.3389/fpls.2012.00110
13
ReumannS.BuchwaldD.LingnerT. (2012). PredPlantPTS1: A web server for the prediction of plant peroxisomal proteins. Front. Plant Sci. 3:194. 10.3389/fpls.2012.00194
14
ReumannS.MaC.LemkeS.BabujeeL. (2004). AraPerox. A database of putative Arabidopsis proteins from plant peroxisomes. Plant Physiol. 136, 2587–2608. 10.1104/pp.104.043695
15
RoseC. M.VenkateshwaranM.GrimsrudP. A.WestphallM. S.SussmanM. R.CoonJ. J.et al. (2012). Medicago phosphoprotein database: a repository for Medicago truncatula phosphoprotein data. Front. Plant Sci. 3:122. 10.3389/fpls.2012.00122
16
SenklerM.BraunH. P. (2012). Functional annotation of 2D protein maps: the GelMap portal. Front. Plant Sci. 3:87. 10.3389/fpls.2012.00087
17
SunQ.ZybailovB.MajeranW.FrisoG.OlinaresP. D.Van WijkK. J. (2009). PPDB, The plant proteomics database at Cornell. Nucleic Acids Res. 37, D969–D974. 10.1093/nar/gkn654
18
TanzS. K.CastledenI.HooperC. M.VacherM.SmallI.MillarH. A. (2013). SUBA3: a database for integrating experimentation and prediction to define the SUBcellular location of proteins in Arabidopsis. Nucleic Acids Res. 41, D1185–D1191. 10.1093/nar/gks1151
19
UsadelB.SchwackeR.NagelA.KerstenB. (2012). GabiPD - the GABI primary database integrates plant proteomic data with gene-centric information. Front. Plant Sci. 3:154. 10.3389/fpls.2012.00154
20
VizcainoJ. A.CoteR. G.CsordasA.DianesJ. A.FabregatA.FosterJ. M.et al. (2013). The Proteomics Identifications (PRIDE) database and associated tools: status in 2013. Nucleic Acids Res. 41, D1063–D1069. 10.1093/nar/gks1262
21
WienkoopS.StaudingerC.HoehenwarterW.WeckwerthW.EgelhoferV. (2012). ProMEX - a mass spectral reference database for plant proteomics. Front. Plant Sci. 3:125. 10.3389/fpls.2012.00125
22
YaoQ.GaoJ.BollingerC.ThelenJ. J.XuD. (2012). Predicting and analyzing protein phosphorylation sites in plants using Musite. Front. Plant Sci. 3:186. 10.3389/fpls.2012.00186

Summary

Keywords

proteomics, informatics, database, phosphorylation, proteogenomic, subcellular

Citation

Carroll AW, Joshi HJ and Heazlewood JL (2013) Managing the green proteomes for the next decade of plant research. Front. Plant Sci. 4:501. doi: 10.3389/fpls.2013.00501

Received

20 November 2013

Accepted

22 November 2013

Published

16 December 2013

Volume

4 - 2013

Edited by

Richard A. Jorgensen, Carnegie Institution for Science, USA

This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: jlheazlewood@lbl.gov

This article was submitted to Plant Proteomics, a section of the journal Frontiers in Plant Science.

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Plant Proteomics and Protein Structural Biology

EDITORIAL article

Managing the green proteomes for the next decade of plant research

Statements

Acknowledgments

References

Summary

Outline

Cite article

Article metrics

EDITORIAL article

Managing the green proteomes for the next decade of plant research

Statements

Acknowledgments

References

Summary

Outline

Cite article

Share article

Article metrics