How to talk about protein-level false discovery rates in shotgun proteomics

被引:36
|
作者
The, Matthew [1 ]
Tasnim, Ayesha [1 ]
Kall, Lukas [1 ]
机构
[1] Royal Inst Technol KTH, Sch Biotechnol, Sci Life Lab, Box 1031, S-17121 Solna, Sweden
关键词
Bioinformatics; Data processing and analysis; Mass spectrometry-LC-MS/MS; Protein inference; Simulation; Statistical analysis; TANDEM MASS-SPECTROMETRY; STATISTICAL SIGNIFICANCE; INFERENCE PROBLEM; PROBABILITIES;
D O I
10.1002/pmic.201500431
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
A frequently sought output from a shotgun proteomics experiment is a list of proteins that we believe to have been present in the analyzed sample before proteolytic digestion. The standard technique to control for errors in such lists is to enforce a preset threshold for the false discovery rate (FDR). Many consider protein-level FDRs a difficult and vague concept, as the measurement entities, spectra, are manifestations of peptides and not proteins. Here, we argue that this confusion is unnecessary and provide a framework on how to think about protein-level FDRs, starting from its basic principle: the null hypothesis. Specifically, we point out that two competing null hypotheses are used concurrently in today's protein inference methods, which has gone unnoticed by many. Using simulations of a shotgun proteomics experiment, we show how confusing one null hypothesis for the other can lead to serious discrepancies in the FDR. Furthermore, we demonstrate how the same simulations can be used to verify FDR estimates of protein inference methods. In particular, we show that, for a simple protein inference method, decoy models can be used to accurately estimate protein-level FDRs for both competing null hypotheses.
引用
收藏
页码:2461 / 2469
页数:9
相关论文
共 38 条
  • [11] Improved False Discovery Rate Estimation Procedure for Shotgun Proteomics (vol 14, pg 3148, 2015)
    Keich, Uri
    Kertesz-Farkas, Attila
    Noble, William Stafford
    JOURNAL OF PROTEOME RESEARCH, 2016, 15 (12) : 4779 - 4780
  • [12] Fast and Accurate Protein False Discovery Rates on Large-Scale Proteomics Data Sets with Percolator 3.0
    The, Matthew
    MacCoss, Michael J.
    Noble, William S.
    Kall, Lukas
    JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY, 2016, 27 (11) : 1719 - 1727
  • [13] Protein Identification False Discovery Rates for Very Large Proteomics Data Sets Generated by Tandem Mass Spectrometry
    Reiter, Lukas
    Claassen, Manfred
    Schrimpf, Sabine P.
    Jovanovic, Marko
    Schmidt, Alexander
    Buhmann, Joachim M.
    Hengartner, Michael O.
    Aebersold, Ruedi
    MOLECULAR & CELLULAR PROTEOMICS, 2009, 8 (11) : 2405 - 2417
  • [14] Comparison of a Protein-Level and Peptide-Level Labeling Strategy for Quantitative Proteomics of Synaptosomes Using Isobaric Tags
    Engmann, Olivia
    Campbell, James
    Ward, Malcolm
    Giese, K. Peter
    Thompson, Andrew J.
    JOURNAL OF PROTEOME RESEARCH, 2010, 9 (05) : 2725 - 2733
  • [15] False discovery rates and related statistical concepts in mass spectrometry-based proteomics
    Choi, Hyungwon
    Nesvizhskii, Alexey I.
    JOURNAL OF PROTEOME RESEARCH, 2008, 7 (01) : 47 - 50
  • [16] Improved results in proteomics by use of local and peptide-class specific false discovery rates
    Lau Sennels
    Jimi-Carlo Bukowski-Wills
    Juri Rappsilber
    BMC Bioinformatics, 10
  • [17] Improved results in proteomics by use of local and peptide-class specific false discovery rates
    Sennels, Lau
    Bukowski-Wills, Jimi-Carlo
    Rappsilber, Juri
    BMC BIOINFORMATICS, 2009, 10
  • [18] Accurate Estimation of Context-Dependent False Discovery Rates in Top-Down Proteomics
    LeDuc, Richard D.
    Fellers, Ryan T.
    Early, Bryan P.
    Greer, Joseph B.
    Shams, Daniel P.
    Thomas, Paul M.
    Kelleher, Neil L.
    MOLECULAR & CELLULAR PROTEOMICS, 2019, 18 (04) : 796 - 805
  • [19] Quantitative Top-Down Proteomics in Complex Samples Using Protein-Level Tandem Mass Tag Labeling
    Yu, Dahang
    Wang, Zhe
    Cupp-Sutton, Kellye A.
    Guo, Yanting
    Kou, Qiang
    Smith, Kenneth
    Liu, Xiaowen
    Wu, Si
    JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY, 2021, 32 (06) : 1336 - 1344
  • [20] Optimization of protein-level tandem mass tag (TMT) labeling conditions in complex samples with top-down proteomics
    Guo, Yanting
    Yu, Dahang
    Cupp-Sutton, Kellye A.
    Liu, Xiaowen
    Wu, Si
    Analytica Chimica Acta, 2022, 1221