Accurate Estimation of Context-Dependent False Discovery Rates in Top-Down Proteomics

被引:29
|
作者
LeDuc, Richard D. [1 ]
Fellers, Ryan T. [1 ]
Early, Bryan P. [1 ,2 ]
Greer, Joseph B. [1 ]
Shams, Daniel P. [3 ]
Thomas, Paul M. [1 ,2 ]
Kelleher, Neil L. [1 ,2 ,4 ,5 ]
机构
[1] Northwestern Univ, Evanston, IL 60208 USA
[2] Northwestern Univ, Dept Mol Biosci, Evanston, IL 60208 USA
[3] Northwestern Univ, Interdisciplinary Biol Sci, Evanston, IL USA
[4] Northwestern Univ, Dept Chem, Evanston, IL 60208 USA
[5] Northwestern Univ, Feinberg Sch Med, Evanston, IL 60208 USA
基金
美国国家卫生研究院;
关键词
PROTEIN IDENTIFICATION; MASS; PROTEOFORM; PEPTIDE;
D O I
10.1074/mcp.RA118.000993
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Within the last several years, top-down proteomics has emerged as a high throughput technique for protein and proteoform identification. This technique has the potential to identify and characterize thousands of proteoforms within a single study, but the absence of accurate false discovery rate (FDR) estimation could hinder the adoption and consistency of top-down proteomics in the future. In automated identification and characterization of proteoforms, FDR calculation strongly depends on the context of the search. The context includes MS data quality, the database being interrogated, the search engine, and the parameters of the search. Particular to top-down proteomics-there are four molecular levels of study: proteoform spectral match (PrSM), protein, isoform, and proteoform. Here, a context-dependent framework for calculating an accurate FDR at each level was designed, implemented, and validated against a manually curated training set with 546 confirmed proteoforms. We examined several search contexts and found that an FDR calculated at the PrSM level under-reported the true FDR at the protein level by an average of 24-fold. We present a new open-source tool, the TDCD_FDR_Calculator, which provides a scalable, context-dependent FDR calculation that can be applied post-search to enhance the quality of results in top-down proteomics from any search engine.
引用
收藏
页码:796 / 805
页数:10
相关论文
共 50 条
  • [1] Precursor deconvolution error estimation: The missing puzzle piece in false discovery rate in top-down proteomics
    Jeong, Kyowon
    Kaulich, Philipp T.
    Jung, Wonhyeuk
    Kim, Jihyung
    Tholey, Andreas
    Kohlbacher, Oliver
    PROTEOMICS, 2024, 24 (3-4)
  • [2] Advances in top-down proteomics for disease biomarker discovery
    Calligaris, David
    Villard, Claude
    Lafitte, Daniel
    JOURNAL OF PROTEOMICS, 2011, 74 (07) : 920 - 934
  • [4] Top-down proteomics
    Roberts, David S.
    Loo, Joseph A.
    Tsybin, Yury O.
    Liu, Xiaowen
    Wu, Si
    Chamot-Rooke, Julia
    Agar, Jeffrey N.
    Pasa-Tolic, Ljiljana
    Smith, Lloyd M.
    Ge, Ying
    NATURE REVIEWS METHODS PRIMERS, 2024, 4 (01):
  • [5] Top-down proteomics
    Kelleher, NL
    ANALYTICAL CHEMISTRY, 2004, 76 (11) : 196A - 203A
  • [6] Discovery top-down proteomics in symbiotic soybean root nodules
    Zhou, Mowei
    Fulcher, James M.
    Zemaitis, Kevin J.
    Degnan, David J.
    Liao, Yen-Chen
    Velickovic, Marija
    Velickovic, Dusan
    Bramer, Lisa M.
    Kew, William R.
    Stacey, Gary
    Pasa-Tolic, Ljiljana
    FRONTIERS IN ANALYTICAL SCIENCE, 2022, 2
  • [7] Advancements in Top-Down Proteomics
    Zhou, Hu
    Ning, Zhibing
    Starr, Amanda E.
    Abu-Farha, Mohamed
    Figeys, Daniel
    ANALYTICAL CHEMISTRY, 2012, 84 (02) : 720 - 734
  • [8] Tearing the Top Off 'Top-Down' Proteomics
    Perkel, Jeffrey M.
    BIOTECHNIQUES, 2012, 53 (02) : 75 - +
  • [9] Editorial: Breakthroughs in top-down proteomics
    Penque, Deborah
    Marcus, Katrin
    Torres, Vukosaca Milic
    JOURNAL OF PROTEOMICS, 2018, 175 : 1 - 2
  • [10] An FDR metric for top-down proteomics
    Doerr, Allison
    NATURE METHODS, 2019, 16 (03) : 218 - 218