How to talk about protein-level false discovery rates in shotgun proteomics

被引:36
|
作者
The, Matthew [1 ]
Tasnim, Ayesha [1 ]
Kall, Lukas [1 ]
机构
[1] Royal Inst Technol KTH, Sch Biotechnol, Sci Life Lab, Box 1031, S-17121 Solna, Sweden
关键词
Bioinformatics; Data processing and analysis; Mass spectrometry-LC-MS/MS; Protein inference; Simulation; Statistical analysis; TANDEM MASS-SPECTROMETRY; STATISTICAL SIGNIFICANCE; INFERENCE PROBLEM; PROBABILITIES;
D O I
10.1002/pmic.201500431
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
A frequently sought output from a shotgun proteomics experiment is a list of proteins that we believe to have been present in the analyzed sample before proteolytic digestion. The standard technique to control for errors in such lists is to enforce a preset threshold for the false discovery rate (FDR). Many consider protein-level FDRs a difficult and vague concept, as the measurement entities, spectra, are manifestations of peptides and not proteins. Here, we argue that this confusion is unnecessary and provide a framework on how to think about protein-level FDRs, starting from its basic principle: the null hypothesis. Specifically, we point out that two competing null hypotheses are used concurrently in today's protein inference methods, which has gone unnoticed by many. Using simulations of a shotgun proteomics experiment, we show how confusing one null hypothesis for the other can lead to serious discrepancies in the FDR. Furthermore, we demonstrate how the same simulations can be used to verify FDR estimates of protein inference methods. In particular, we show that, for a simple protein inference method, decoy models can be used to accurately estimate protein-level FDRs for both competing null hypotheses.
引用
收藏
页码:2461 / 2469
页数:9
相关论文
共 38 条
  • [21] An assessment of false discovery rates and statistical significance in label-free quantitative proteomics with combined filters
    Li, Qingbo
    Roxas, Bryan A. P.
    BMC BIOINFORMATICS, 2009, 10
  • [22] An assessment of false discovery rates and statistical significance in label-free quantitative proteomics with combined filters
    Qingbo Li
    Bryan AP Roxas
    BMC Bioinformatics, 10
  • [23] A benchmarking protocol for intact protein-level Tandem Mass Tag (TMT) labeling for quantitative top-down proteomics
    Guo, Yanting
    Yu, Dahang
    Cupp-Sutton, Kellye A.
    Liu, Xiaowen
    Wu, Si
    METHODSX, 2022, 9
  • [24] Optimization of protein-level tandem mass tag (TMT) labeling conditions in complex samples with top-down proteomics
    Guo, Yanting
    Yu, Dahang
    Cupp-Sutton, Kellye A.
    Liu, Xiaowen
    Wu, Si
    ANALYTICA CHIMICA ACTA, 2022, 1221
  • [25] Estimating false discovery rates for peptide and protein identification using randomized databases
    Hather, Gregory
    Higdon, Roger
    Bauman, Andrew
    von Haller, Priska D.
    Kolker, Eugene
    PROTEOMICS, 2010, 10 (12) : 2369 - 2376
  • [26] QPROT: Statistical method for testing differential expression using protein-level intensity data in label-free quantitative proteomics
    Choi, Hyungwon
    Kim, Sinae
    Fermin, Damian
    Tsou, Chih-Chiang
    Nesvizhskii, Alexey I.
    JOURNAL OF PROTEOMICS, 2015, 129 : 121 - 126
  • [27] Shotgun proteomics aids discovery of novel protein-coding genes, alternative splicing, and "resurrected" pseudogenes in the mouse genome
    Brosch, Markus
    Saunders, Gary I.
    Frankish, Adam
    Collins, Mark O.
    Yu, Lu
    Wright, James
    Verstraten, Ruth
    Adams, David J.
    Harrow, Jennifer
    Choudhary, Jyoti S.
    Hubbard, Tim
    GENOME RESEARCH, 2011, 21 (05) : 756 - 767
  • [28] IPM: An integrated protein model for false discovery rate estimation and identification in high-throughput proteomics
    Higdon, Roger
    Reiter, Lukas
    Hather, Gregory
    Haynes, Winston
    Kolker, Natali
    Stewart, Elizabeth
    Bauman, Andrew T.
    Picotti, Paola
    Schmidt, Alexander
    van Belle, Gerald
    Aebersold, Ruedi
    Kolker, Eugene
    JOURNAL OF PROTEOMICS, 2011, 75 (01) : 116 - 121
  • [29] False Discovery Rates of Protein Identifications: A Strike against the Two-Peptide Rule
    Gupta, Nitin
    Pevzner, Pavel A.
    JOURNAL OF PROTEOME RESEARCH, 2009, 8 (09) : 4173 - 4181
  • [30] How to Train a Postprocessor for Tandem Mass Spectrometry Proteomics Database Search While Maintaining Control of the False Discovery Rate
    Freestone, Jack
    Kall, Lukas
    Noble, William Stafford
    Keich, Uri
    JOURNAL OF PROTEOME RESEARCH, 2025,