PhoStar: Identifying Tandem Mass Spectra of Phosphorylated Peptides before Database Search

被引:8
|
作者
Dorl, Sebastian [1 ]
Winkler, Stephan [1 ]
Mechtler, Karl [2 ,3 ]
Dorfer, Viktoria [1 ]
机构
[1] Univ Appl Sci Upper Austria, Bioinformat Res Grp, Softwarepk 11, D-4232 Hagenberg, Germany
[2] Res Inst Mol Pathol IMP, Prot Chem, Campus Vienna Bioctr 1, A-1030 Vienna, Austria
[3] Inst Mol Biotechnol IMBA, Prot Chem, Chem Vienna Bioctr VBC, Dr Bohr Gasse 3, A-1030 Vienna, Austria
基金
奥地利科学基金会;
关键词
mass spectrometry; proteomics; post-translational modification; phosphorylation; search space reduction; machine learning random forest classification; POSTTRANSLATIONAL MODIFICATIONS; PROTEIN IDENTIFICATION; SPECTROMETRY; ALGORITHM; TOOL; PREDICTION;
D O I
10.1021/acs.jproteome.7b00563
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Standard proteomics workflows use tandem mass spectrometry followed by sequence database search to analyze complex biological samples. The identification of proteins carrying post-translational modifications, for example, phosphorylation, is typically addressed by allowing variable modifications in the searched sequences. Accounting for these variations exponentially increases the combinatorial space in the database, which leads to increased processing times and more false positive identifications. The here-presented tool PhoStar identifies spectra that originate from phosphorylated peptides before database search using a supervised machine learning approach. The model for the prediction of phosphorylation was trained and validated with an accuracy of 97.6% on a large set of high-confidence spectra collected from publicly available experimental data. Its power was further validated by predicting phosphorylation in the complete NIST human and mouse high collision dissociation spectral libraries, achieving an accuracy of 98.2 and 97.9%, respectively. We demonstrate the application of PhoStar by using it for spectra filtering before database search. In database search of HeLa samples the peptide search space was reduced by 27-66% while finding at least 97% of total peptide identifications (at 1% FDR) compared with a standard workflow.
引用
收藏
页码:290 / 295
页数:6
相关论文
共 50 条
  • [21] Origin of Disagreements in Tandem Mass Spectra Interpretation by Search Engines
    Tessier, Dominique
    Lollier, Virginie
    Larre, Colette
    Rogniaux, Helene
    JOURNAL OF PROTEOME RESEARCH, 2016, 15 (10) : 3481 - 3488
  • [22] Prediction of Novel Modifications by Unrestrictive Search of Tandem Mass Spectra
    Na, Seungjin
    Paek, Eunok
    JOURNAL OF PROTEOME RESEARCH, 2009, 8 (10) : 4418 - 4427
  • [23] XLSearch: a Probabilistic Database Search Algorithm for Identifying Cross-Linked Peptides
    Ji, Chao
    Li, Sujun
    Reilly, James P.
    Radivojac, Predrag
    Tang, Haixu
    JOURNAL OF PROTEOME RESEARCH, 2016, 15 (06) : 1830 - 1841
  • [24] Protein Inference by Assembling Peptides Identified from Tandem Mass Spectra
    Shi, Jinhong
    Wu, Fang-Xiang
    CURRENT BIOINFORMATICS, 2009, 4 (03) : 226 - 233
  • [25] Impact of ion trap tandem mass spectra variability on the identification of peptides
    Venable, JD
    Yates, JR
    ANALYTICAL CHEMISTRY, 2004, 76 (10) : 2928 - 2937
  • [26] The Interpretation of Collision-Induced Dissociation Tandem Mass Spectra of Peptides
    Papayannopoulos, I. A.
    Mass Spectrometry Reviews, 14 (01):
  • [27] Interpretation of Tandem Mass Spectra Obtained from Cyclic Nonribosomal Peptides
    Liu, Wei-Ting
    Ng, Julio
    Meluzzi, Dario
    Bandeira, Nuno
    Gutierrez, Marcelino
    Simmons, Thomas L.
    Schultz, Andrew W.
    Linington, Roger G.
    Moore, Bradley S.
    Gerwick, William H.
    Pevzner, Pavel A.
    Dorrestein, Pieter C.
    ANALYTICAL CHEMISTRY, 2009, 81 (11) : 4200 - 4209
  • [28] Dereplication of microbial metabolites through database search of mass spectra
    Mohimani, Hosein
    Gurevich, Alexey
    Shlemov, Alexander
    Mikheenko, Alla
    Korobeynikov, Anton
    Cao, Liu
    Shcherbin, Egor
    Nothias, Louis-Felix
    Dorrestein, Pieter C.
    Pevzner, Pavel A.
    NATURE COMMUNICATIONS, 2018, 9
  • [29] Dereplication of microbial metabolites through database search of mass spectra
    Hosein Mohimani
    Alexey Gurevich
    Alexander Shlemov
    Alla Mikheenko
    Anton Korobeynikov
    Liu Cao
    Egor Shcherbin
    Louis-Felix Nothias
    Pieter C. Dorrestein
    Pavel A. Pevzner
    Nature Communications, 9
  • [30] InsPecT: Identification of posttransiationally modified peptides from tandem mass spectra
    Tanner, S
    Shu, HJ
    Frank, A
    Wang, LC
    Zandi, E
    Mumby, M
    Pevzner, PA
    Bafna, V
    ANALYTICAL CHEMISTRY, 2005, 77 (14) : 4626 - 4639