ProMEX: a mass spectral reference database for proteins and protein phosphorylation sites

被引:67
|
作者
Hummel, Jan
Niemann, Michaela
Wienkoop, Stefanie
Schulze, Waltraud
Steinhauser, Dirk
Selbig, Joachim
Walther, Dirk
Weckwerth, Wolfram
机构
[1] Max Planck Inst Mol Plant Physiol, D-14424 Potsdam, Germany
[2] Univ Potsdam, MPIMP, Inst Biochem & Biol, D-14424 Potsdam, Germany
关键词
D O I
10.1186/1471-2105-8-216
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: In the last decade, techniques were established for the large scale genome-wide analysis of proteins, RNA, and metabolites, and database solutions have been developed to manage the generated data sets. The Golm Metabolome Database for metabolite data (GMD) represents one such effort to make these data broadly available and to interconnect the different molecular levels of a biological system [1]. As data interpretation in the light of already existing data becomes increasingly important, these initiatives are an essential part of current and future systems biology. Results: A mass spectral library consisting of experimentally derived tryptic peptide product ion spectra was generated based on liquid chromatography coupled to ion trap mass spectrometry ( LC-IT-MS). Protein samples derived from Arabidopsis thaliana, Chlamydomonas reinhardii, Medicago truncatula, and Sinorhizobium meliloti were analysed. With currently 4,557 manually validated spectra associated with 4,226 unique peptides from 1,367 proteins, the database serves as a continuously growing reference data set and can be used for protein identification and quantification in uncharacterized biological samples. For peptide identification, several algorithms were implemented based on a recently published study for peptide mass fingerprinting [2] and tested for false positive and negative rates. An algorithm which considers intensity distribution for match correlation scores was found to yield best results. For proof of concept, an LC-IT-MS analysis of a tryptic leaf protein digest was converted to mzData format and searched against the mass spectral library. The utility of the mass spectral library was also tested for the identification of phosphorylated tryptic peptides. We included in vivo phosphorylation sites of Arabidopsis thaliana proteins and the identification performance was found to be improved compared to genome-based search algorithms. Protein identification by ProMEX is linked to other levels of biological organization such as metabolite, pathway, and transcript data. The database is further connected to annotation and classification services via BioMoby. Conclusion: The ProMEX protein/ peptide database represents a mass spectral reference library with the capability of matching unknown samples for protein identification. The database allows text searches based on metadata such as experimental information of the samples, mass spectrometric instrument parameters or unique protein identifier like AGI codes. ProMEX integrates proteomics data with other levels of molecular organization including metabolite, pathway, and transcript information and may thus become a useful resource for plant systems biology studies. The ProMEX mass spectral library is available at http://promex.mpimp-golm.mpg.de/.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] ProMEX: a mass spectral reference database for proteins and protein phosphorylation sites
    Jan Hummel
    Michaela Niemann
    Stefanie Wienkoop
    Waltraud Schulze
    Dirk Steinhauser
    Joachim Selbig
    Dirk Walther
    Wolfram Weckwerth
    BMC Bioinformatics, 8
  • [2] ProMEX - a mass spectral reference database for plant proteomics
    Wienkoop, Stefanie
    Staudinger, Christiana
    Hoehenwarter, Wolfgang
    Weckwerth, Wolfram
    Egelhofer, Volker
    FRONTIERS IN PLANT SCIENCE, 2012, 3
  • [3] Mapping phosphorylation sites in proteins by mass spectrometry
    Shou, WY
    Verma, R
    Annan, RS
    Huddleston, MJ
    Chen, SL
    Carr, SA
    Deshaies, RJ
    GUIDE TO YEAST GENETICS AND MOLECULAR AND CELL BIOLOGY, PT C, 2002, 351 : 279 - 296
  • [4] dbPSP: a curated database for protein phosphorylation sites in prokaryotes
    Pan, Zhicheng
    Wang, Bangshan
    Zhang, Ying
    Wang, Yongbo
    Ullah, Shahid
    Jian, Ren
    Liu, Zexian
    Xue, Yu
    DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2015,
  • [5] PhosphoPep—a database of protein phosphorylation sites in model organisms
    Bernd Bodenmiller
    David Campbell
    Bertran Gerrits
    Henry Lam
    Marko Jovanovic
    Paola Picotti
    Ralph Schlapbach
    Ruedi Aebersold
    Nature Biotechnology, 2008, 26 : 1339 - 1340
  • [6] PhosphoPep-a database of protein phosphorylation sites in model organisms
    Bodenmiller, Bernd
    Campbell, David
    Gerrits, Bertran
    Lam, Henry
    Jovanovic, Marko
    Picotti, Paola
    Schlapbach, Ralph
    Aebersold, Ruedi
    NATURE BIOTECHNOLOGY, 2008, 26 (12) : 1339 - 1340
  • [7] dbPSP 2.0, an updated database of protein phosphorylation sites in prokaryotes
    Shi, Ying
    Zhang, Ying
    Lin, Shaofeng
    Wang, Chenwei
    Zhou, Jiaqi
    Peng, Di
    Xue, Yu
    SCIENTIFIC DATA, 2020, 7 (01)
  • [8] dbPSP 2.0, an updated database of protein phosphorylation sites in prokaryotes
    Ying Shi
    Ying Zhang
    Shaofeng Lin
    Chenwei Wang
    Jiaqi Zhou
    Di Peng
    Yu Xue
    Scientific Data, 7
  • [9] Identification of phosphorylation sites on neurofilament proteins by nanoelectrospray mass spectrometry
    Betts, JC
    Blackstock, WP
    Ward, MA
    Anderton, BH
    JOURNAL OF BIOLOGICAL CHEMISTRY, 1997, 272 (20) : 12922 - 12927
  • [10] PhosphoBase: a database of phosphorylation sites
    Blom, N
    Kreegipuu, A
    Brunak, S
    NUCLEIC ACIDS RESEARCH, 1998, 26 (01) : 382 - 386