Evaluating Proteomics Imputation Methods with Improved Criteria

被引:4
|
作者
Harris, Lincoln [1 ]
Fondrie, William E. [2 ]
Oh, Sewoong [3 ]
Noble, William S. [1 ,3 ]
机构
[1] Univ Washington, Dept Genome Sci, Seattle, WA 98195 USA
[2] Talus Biosci, Seattle, WA 98112 USA
[3] Univ Washington, Paul G Allen Sch Comp Sci & Engn, Seattle, WA 98195 USA
关键词
quantitative mass spectrometry; proteomics; imputation; machine learning; statistics; differential expression; lower limit of quantification; MISSING VALUE IMPUTATION; MASS SPECTROMETRY; R-PACKAGE; SETS;
D O I
10.1021/acs.jproteome.3c00205
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Quantitative measurements produced by tandem mass spectrometry proteomics experiments typically contain a large proportion of missing values. Missing values hinder reproducibility, reduce statistical power, and make it difficult to compare across samples or experiments. Although many methods exist for imputing missing values, in practice, the most commonly used methods are among the worst performing. Furthermore, previous benchmarking studies have focused on relatively simple measurements of error such as the mean-squared error between imputed and held-out values. Here we evaluate the performance of commonly used imputation methods using three practical, "downstream-centric" criteria. These criteria measure the ability to identify differentially expressed peptides, generate new quantitative peptides, and improve the peptide lower limit of quantification. Our evaluation comprises several experiment types and acquisition strategies, including data-dependent and data-independent acquisition. We find that imputation does not necessarily improve the ability to identify differentially expressed peptides but that it can identify new quantitative peptides and improve the peptide lower limit of quantification. We find that MissForest is generally the best performing method per our downstream-centric criteria. We also argue that existing imputation methods do not properly account for the variance of peptide quantifications and highlight the need for methods that do.
引用
收藏
页码:3427 / 3438
页数:12
相关论文
共 50 条
  • [21] An in-depth benchmark framework for evaluating single cell RNA-seq dropout imputation methods and the development of an improved algorithm afMF
    Huang, Jinghan
    Chow, Anson C. M.
    Tang, Nelson L. S.
    Yam, Sheung Chi Phillip
    CLINICAL AND TRANSLATIONAL MEDICINE, 2025, 15 (04):
  • [22] Evaluating imputation methods for single-cell RNA-seq data
    Yi Cheng
    Xiuli Ma
    Lang Yuan
    Zhaoguo Sun
    Pingzhang Wang
    BMC Bioinformatics, 24
  • [23] A Review of Criteria and Methods for Evaluating the Probiotic Potential of Microorganisms
    Byakika, Stellah
    Mukisa, Ivan Muzira
    Byaruhanga, Yusuf Byenkya
    Muyanja, Charles
    FOOD REVIEWS INTERNATIONAL, 2019, 35 (05) : 427 - 466
  • [24] Criteria for evaluating research: the unique adequacy requirement of methods
    Rooke, John Alfred
    Kagioglou, Mike
    CONSTRUCTION MANAGEMENT AND ECONOMICS, 2007, 25 (09) : 979 - 987
  • [25] CRITERIA AND METHODS FOR PERFORMING AND EVALUATING SOLAR WEATHER STUDIES
    FLUECK, JA
    BROWN, TJ
    JOURNAL OF CLIMATE, 1993, 6 (02) : 373 - 385
  • [26] A Decade of Evaluating Europeana - Constructs, Contexts, Methods & Criteria
    Petras, Vivien
    Stiller, Juliane
    RESEARCH AND ADVANCED TECHNOLOGY FOR DIGITAL LIBRARIES (TPDL 2017), 2017, 10450 : 233 - 245
  • [27] Criteria for evaluating group decision-making methods
    Peniwati, Kirti
    MATHEMATICAL AND COMPUTER MODELLING, 2007, 46 (7-8) : 935 - 947
  • [28] New Criteria for Evaluating Methods of Identifying Hot Spots
    Cheng, Wen
    Washington, Simon
    TRANSPORTATION RESEARCH RECORD, 2008, (2083) : 76 - 85
  • [29] IMPROVED METHODS FOR EVALUATING STARCH FOR SPECIFIC USES
    CAMPBELL, HA
    HOLLIS, F
    MACALLISTER, RV
    FOOD TECHNOLOGY, 1950, 4 (12) : 492 - 496
  • [30] IMPROVED METHODS OF EVALUATING MODERN GEAR LUBRICANTS
    POTTER, RI
    GAGLIARD.JC
    SKUTNICK, WJ
    WALTHALL, OK
    SAE TRANSACTIONS, 1969, 78 : 90 - &