How to talk about protein-level false discovery rates in shotgun proteomics

被引:36
|
作者
The, Matthew [1 ]
Tasnim, Ayesha [1 ]
Kall, Lukas [1 ]
机构
[1] Royal Inst Technol KTH, Sch Biotechnol, Sci Life Lab, Box 1031, S-17121 Solna, Sweden
关键词
Bioinformatics; Data processing and analysis; Mass spectrometry-LC-MS/MS; Protein inference; Simulation; Statistical analysis; TANDEM MASS-SPECTROMETRY; STATISTICAL SIGNIFICANCE; INFERENCE PROBLEM; PROBABILITIES;
D O I
10.1002/pmic.201500431
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
A frequently sought output from a shotgun proteomics experiment is a list of proteins that we believe to have been present in the analyzed sample before proteolytic digestion. The standard technique to control for errors in such lists is to enforce a preset threshold for the false discovery rate (FDR). Many consider protein-level FDRs a difficult and vague concept, as the measurement entities, spectra, are manifestations of peptides and not proteins. Here, we argue that this confusion is unnecessary and provide a framework on how to think about protein-level FDRs, starting from its basic principle: the null hypothesis. Specifically, we point out that two competing null hypotheses are used concurrently in today's protein inference methods, which has gone unnoticed by many. Using simulations of a shotgun proteomics experiment, we show how confusing one null hypothesis for the other can lead to serious discrepancies in the FDR. Furthermore, we demonstrate how the same simulations can be used to verify FDR estimates of protein inference methods. In particular, we show that, for a simple protein inference method, decoy models can be used to accurately estimate protein-level FDRs for both competing null hypotheses.
引用
收藏
页码:2461 / 2469
页数:9
相关论文
共 38 条
  • [1] Decoy Methods for Assessing False Positives and False Discovery Rates in Shotgun Proteomics
    Wang, Guanghui
    Wu, Wells W.
    Zhang, Zheng
    Masilamani, Shyama
    Shen, Rong-Fong
    ANALYTICAL CHEMISTRY, 2009, 81 (01) : 146 - 159
  • [2] A new estimation of protein-level false discovery rate
    Wu, Guanying
    Wan, Xiang
    Xu, Baohua
    BMC GENOMICS, 2018, 19
  • [3] A new estimation of protein-level false discovery rate
    Guanying Wu
    Xiang Wan
    Baohua Xu
    BMC Genomics, 19
  • [4] Decoy-free protein-level false discovery rate estimation
    Teng, Ben
    Huang, Ting
    He, Zengyou
    BIOINFORMATICS, 2014, 30 (05) : 675 - 681
  • [5] Improved False Discovery Rate Estimation Procedure for Shotgun Proteomics
    Keich, Uri
    Kertesz-Farkas, Attila
    Noble, William Stafford
    JOURNAL OF PROTEOME RESEARCH, 2015, 14 (08) : 3148 - 3161
  • [6] Empirical approach to false discovery rate estimation in shotgun proteomics
    Goloborodko, Anton A.
    Mayerhofer, Corina
    Zubarev, Alexander R.
    Tarasova, Irina A.
    Gorshkov, Alexander V.
    Zubarev, Roman A.
    Gorshkov, Mikhail V.
    RAPID COMMUNICATIONS IN MASS SPECTROMETRY, 2010, 24 (04) : 454 - 462
  • [7] Common Decoy Distributions Simplify False Discovery Rate Estimation in Shotgun Proteomics
    Madej, Dominik
    Wu, Long
    Lam, Henry
    JOURNAL OF PROTEOME RESEARCH, 2022, 21 (02) : 339 - 348
  • [8] Unbiased False Discovery Rate Estimation for Shotgun Proteomics Based on the Target-Decoy Approach
    Levitsky, Lev I.
    Ivanov, Mark V.
    Lobas, Anna A.
    Gorshkov, Mikhail V.
    JOURNAL OF PROTEOME RESEARCH, 2017, 16 (02) : 393 - 397
  • [9] ProteoStats-a library for estimating false discovery rates in proteomics pipelines
    Yadav, Amit Kumar
    Kadimi, Puneet Kumar
    Kumar, Dhirendra
    Dash, Debasis
    BIOINFORMATICS, 2013, 29 (21) : 2799 - 2800
  • [10] Bayesian false discovery rates for post-translational modification proteomics
    Fu, Yan
    STATISTICS AND ITS INTERFACE, 2012, 5 (01) : 47 - 59