Addressing varying non-ignorable missing data mechanisms using a penalized EM algorithm: application to quantitative proteomics data

被引:0
|
作者
Ryu, So Young [1 ]
机构
[1] Univ Nevada, Sch Community Hlth Sci, 1664 N Virginia St, Reno, NV 89557 USA
基金
美国国家卫生研究院;
关键词
Mass spectrometry; Proteomics; Protein relative quantitation; Non-ignorable missing data; Varying missing data mechanisms; PROTEIN; ABUNDANCE;
D O I
10.4310/SII.2018.v11.n4.a3
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In multi-laboratory collaborative or large-scale proteomic studies, it is challenging to analyze data properly due to varying non-ignorable missing data mechanisms across experiments. PEMM (Penalized EM algorithm incorporating missing data mechanism) proposed by Chen, Prentice and Wang [1] estimates both the mean and the covariance of protein abundances in the presence of non-ignorable missing data; however, PEMM assumes a common missing mechanism for all experiments. This approach may be adequate when experiments are performed under similar conditions, but it may not work optimally when experiments are conducted in different laboratories or over a long period of time. In this paper, we extend PEMM to appropriately handle varying missing data mechanisms for datasets generated at multiple laboratories. Recognizing that jointly estimating missing mechanisms and parameters of interest is a challenging task, we assume that missing data mechanisms are known, and demonstrate benefits of incorporating multiple missing mechanisms for datasets generated at different laboratories. We call our algorithm PEMvM (Penalized EM algorithm for varying non-ignorable missing mechanisms). Our extension is simple and enjoys all the properties that PEMM offers. When missing data mechanisms differ across experiments, PEMvM performs better than PEMM in terms of accurate mean estimation and data imputation. In this paper, we demonstrate the performance of PEMvM using both simulated and real proteomic data.
引用
收藏
页码:581 / 586
页数:6
相关论文
共 50 条
  • [1] Monte Carlo EM algorithm in logistic linear models involving non-ignorable missing data
    Park, Jeong-Soo
    Qian, Guoqi Q.
    Jun, Yuna
    [J]. APPLIED MATHEMATICS AND COMPUTATION, 2008, 197 (01) : 440 - 450
  • [2] Longitudinal data analysis with non-ignorable missing data
    Tseng, Chi-hong
    Elashoff, Robert
    Li, Ning
    Li, Gang
    [J]. STATISTICAL METHODS IN MEDICAL RESEARCH, 2016, 25 (01) : 205 - 220
  • [3] A Latent Variable Model with Non-Ignorable Missing Data
    Takahiro Hoshino
    [J]. Behaviormetrika, 2005, 32 (1) : 71 - 93
  • [4] Mediation Analysis in Categorical Variables under Non-Ignorable Missing Data Mechanisms
    Sarah Ayoku
    Haresh Rochani
    Hani Samawi
    Jingjing Yin
    [J]. Journal of Statistical Theory and Practice, 2023, 17
  • [5] Mediation Analysis in Categorical Variables under Non-Ignorable Missing Data Mechanisms
    Ayoku, Sarah
    Rochani, Haresh
    Samawi, Hani
    Yin, Jingjing
    [J]. JOURNAL OF STATISTICAL THEORY AND PRACTICE, 2023, 17 (04)
  • [6] Empirical likelihood method for non-ignorable missing data problems
    Zhong Guan
    Jing Qin
    [J]. Lifetime Data Analysis, 2017, 23 : 113 - 135
  • [7] Modelling non-ignorable missing-data mechanisms with item response theory models
    Holman, R
    Glas, CAW
    [J]. BRITISH JOURNAL OF MATHEMATICAL & STATISTICAL PSYCHOLOGY, 2005, 58 : 1 - 17
  • [8] Empirical likelihood method for non-ignorable missing data problems
    Guan, Zhong
    Qin, Jing
    [J]. LIFETIME DATA ANALYSIS, 2017, 23 (01) : 113 - 135
  • [9] A Bayesian algorithm based on auxiliary variables for estimating GRM with non-ignorable missing data
    Zhang, Jiwei
    Zhang, Zhaoyuan
    Tao, Jian
    [J]. COMPUTATIONAL STATISTICS, 2021, 36 (04) : 2643 - 2669
  • [10] Scoring and interpretation of daily diary data in the presence of non-ignorable missing data
    Griffiths, Philip
    Floden, Lysbeth L.
    Hudgens, Stacie
    [J]. QUALITY OF LIFE RESEARCH, 2017, 26 (01) : 70 - 70