Integrated approach for manual evaluation of peptides identified by searching protein sequence databases with tandem mass spectra

被引:161
|
作者
Chen, Y [1 ]
Kwon, SW [1 ]
Kim, SC [1 ]
Zhao, YM [1 ]
机构
[1] Univ Texas, SW Med Ctr, Dept Biochem, Dallas, TX 75390 USA
关键词
protein identification; manual evaluation; automated database search;
D O I
10.1021/pr049754t
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Quantitative proteomics relies on accurate protein identification, which often is carried out by automated searching of a sequence database with tandem mass spectra of peptides. When these spectra contain limited information, automated searches may lead to incorrect peptide identifications. It is therefore necessary to validate the identifications by careful manual inspection of the mass spectra. Not only is this task time-consuming, but the reliability of the validation varies with the experience of the analyst. Here, we report a systematic approach to evaluating peptide identifications made by automated search algorithms. The method is based on the principle that the candidate peptide sequence should adequately explain the observed fragment ions. Also, the mass errors of neighboring fragments should be similar. To evaluate our method, we studied tandem mass spectra obtained from tryptic digests of E. coli and HeLa cells. Candidate peptides were identified with the automated search engine Mascot and subjected to the manual validation method. The method found correct peptide identifications that were given low Mascot scores (e.g., 20-25) and incorrect peptide identifications that were given high Mascot scores (e.g., 40-50). The method comprehensively detected false results from searches designed to produce incorrect identifications. Comparison of the tandem mass spectra of synthetic candidate peptides to the spectra obtained from the complex peptide mixtures confirmed the accuracy of the evaluation method. Thus, the evaluation approach described here could help boost the accuracy of protein identification, increase number of peptides identified, and provide a step toward developing a more accurate next-generation algorithm for protein identification.
引用
收藏
页码:998 / 1005
页数:8
相关论文
共 43 条
  • [1] Protein Inference by Assembling Peptides Identified from Tandem Mass Spectra
    Shi, Jinhong
    Wu, Fang-Xiang
    CURRENT BIOINFORMATICS, 2009, 4 (03) : 226 - 233
  • [2] A feedback framework for protein inference with peptides identified from tandem mass spectra
    Shi, Jinhong
    Wu, Fang-Xiang
    PROTEOME SCIENCE, 2012, 10
  • [3] A feedback framework for protein inference with peptides identified from tandem mass spectra
    Jinhong Shi
    Fang-Xiang Wu
    Proteome Science, 10
  • [4] A SEQUENCE PROPERTY APPROACH TO SEARCHING PROTEIN DATABASES
    HOBOHM, U
    SANDER, C
    JOURNAL OF MOLECULAR BIOLOGY, 1995, 251 (03) : 390 - 399
  • [5] Assigning significance to peptides identified by tandem mass spectrometry using decoy databases
    Kaell, Lukas
    Storey, John D.
    MacCoss, Michael J.
    Noble, William Stafford
    JOURNAL OF PROTEOME RESEARCH, 2008, 7 (01) : 29 - 34
  • [6] Searching molecular structure databases with tandem mass spectra using CSI:FingerID
    Duehrkop, Kai
    Shen, Huibin
    Meusel, Marvin
    Rousu, Juho
    Boecker, Sebastian
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2015, 112 (41) : 12580 - 12585
  • [7] A novel scoring schema for peptide identification by searching protein sequence databases using tandem mass spectrometry data
    Zhuo Zhang
    Shiwei Sun
    Xiaopeng Zhu
    Suhua Chang
    Xiaofei Liu
    Chungong Yu
    Dongbo Bu
    Runsheng Chen
    BMC Bioinformatics, 7
  • [8] A novel scoring schema for peptide identification by searching protein sequence databases using tandem mass spectrometry data
    Zhang, Zhuo
    Sun, Shiwei
    Zhu, Xiaopeng
    Chang, Suhua
    Liu, Xiaofei
    Yu, Chungong
    Bu, Dongbo
    Chen, Runsheng
    BMC BIOINFORMATICS, 2006, 7 (1)
  • [9] Searching sequence databases via de novo peptide sequencing by tandem mass spectrometry
    Johnson, RS
    Taylor, JA
    MOLECULAR BIOTECHNOLOGY, 2002, 22 (03) : 301 - 315
  • [10] Searching sequence databases via De novo peptide sequencing by tandem mass spectrometry
    Richard S. Johnson
    J. Alex Taylor
    Molecular Biotechnology, 2002, 22 : 301 - 315