How to talk about protein-level false discovery rates in shotgun proteomics

被引：36

作者：

The, Matthew ^{[1
]}

Tasnim, Ayesha ^{[1
]}

Kall, Lukas ^{[1
]}

机构：

[1] Royal Inst Technol KTH, Sch Biotechnol, Sci Life Lab, Box 1031, S-17121 Solna, Sweden

来源：

PROTEOMICS | 2016年 / 16卷 / 18期

关键词：

Bioinformatics; Data processing and analysis; Mass spectrometry-LC-MS/MS; Protein inference; Simulation; Statistical analysis; TANDEM MASS-SPECTROMETRY; STATISTICAL SIGNIFICANCE; INFERENCE PROBLEM; PROBABILITIES;

D O I：

10.1002/pmic.201500431

中图分类号：

Q5 [生物化学];

学科分类号：

071010 ; 081704 ;

摘要：

A frequently sought output from a shotgun proteomics experiment is a list of proteins that we believe to have been present in the analyzed sample before proteolytic digestion. The standard technique to control for errors in such lists is to enforce a preset threshold for the false discovery rate (FDR). Many consider protein-level FDRs a difficult and vague concept, as the measurement entities, spectra, are manifestations of peptides and not proteins. Here, we argue that this confusion is unnecessary and provide a framework on how to think about protein-level FDRs, starting from its basic principle: the null hypothesis. Specifically, we point out that two competing null hypotheses are used concurrently in today's protein inference methods, which has gone unnoticed by many. Using simulations of a shotgun proteomics experiment, we show how confusing one null hypothesis for the other can lead to serious discrepancies in the FDR. Furthermore, we demonstrate how the same simulations can be used to verify FDR estimates of protein inference methods. In particular, we show that, for a simple protein inference method, decoy models can be used to accurately estimate protein-level FDRs for both competing null hypotheses.

引用

页码：2461 / 2469

页数：9

共 38 条

[21] An assessment of false discovery rates and statistical significance in label-free quantitative proteomics with combined filters
Li, Qingbo
Roxas, Bryan A. P.
BMC BIOINFORMATICS, 2009, 10
[22] An assessment of false discovery rates and statistical significance in label-free quantitative proteomics with combined filters
Qingbo Li
Bryan AP Roxas
BMC Bioinformatics, 10
[23] A benchmarking protocol for intact protein-level Tandem Mass Tag (TMT) labeling for quantitative top-down proteomics
Guo, Yanting
Yu, Dahang
Cupp-Sutton, Kellye A.
Liu, Xiaowen
Wu, Si
METHODSX, 2022, 9
[24] Optimization of protein-level tandem mass tag (TMT) labeling conditions in complex samples with top-down proteomics
Guo, Yanting
Yu, Dahang
Cupp-Sutton, Kellye A.
Liu, Xiaowen
Wu, Si
ANALYTICA CHIMICA ACTA, 2022, 1221
[25] Estimating false discovery rates for peptide and protein identification using randomized databases
Hather, Gregory
Higdon, Roger
Bauman, Andrew
von Haller, Priska D.
Kolker, Eugene
PROTEOMICS, 2010, 10 (12) : 2369 - 2376
[26] QPROT: Statistical method for testing differential expression using protein-level intensity data in label-free quantitative proteomics
Choi, Hyungwon
Kim, Sinae
Fermin, Damian
Tsou, Chih-Chiang
Nesvizhskii, Alexey I.
JOURNAL OF PROTEOMICS, 2015, 129 : 121 - 126
[27] Shotgun proteomics aids discovery of novel protein-coding genes, alternative splicing, and "resurrected" pseudogenes in the mouse genome
Brosch, Markus
Saunders, Gary I.
Frankish, Adam
Collins, Mark O.
Yu, Lu
Wright, James
Verstraten, Ruth
Adams, David J.
Harrow, Jennifer
Choudhary, Jyoti S.
Hubbard, Tim
GENOME RESEARCH, 2011, 21 (05) : 756 - 767
[28] IPM: An integrated protein model for false discovery rate estimation and identification in high-throughput proteomics
Higdon, Roger
Reiter, Lukas
Hather, Gregory
Haynes, Winston
Kolker, Natali
Stewart, Elizabeth
Bauman, Andrew T.
Picotti, Paola
Schmidt, Alexander
van Belle, Gerald
Aebersold, Ruedi
Kolker, Eugene
JOURNAL OF PROTEOMICS, 2011, 75 (01) : 116 - 121
[29] False Discovery Rates of Protein Identifications: A Strike against the Two-Peptide Rule
Gupta, Nitin
Pevzner, Pavel A.
JOURNAL OF PROTEOME RESEARCH, 2009, 8 (09) : 4173 - 4181
[30] How to Train a Postprocessor for Tandem Mass Spectrometry Proteomics Database Search While Maintaining Control of the False Discovery Rate
Freestone, Jack
Kall, Lukas
Noble, William Stafford
Keich, Uri
JOURNAL OF PROTEOME RESEARCH, 2025,

← 1 2 3 4 →