Assessment of maximum likelihood PCA missing data imputation

被引:15
|
作者
Folch-Fortuny, Abel [1 ]
Arteaga, Francisco [2 ]
Ferrer, Alberto [1 ]
机构
[1] Univ Politecn Valencia, Dept Estadist & Invest Operat Aplicadas & Calidad, Multivariate Stat Engn GIEM, Camino Vera S-N, E-46022 Valencia, Spain
[2] Univ Catolica Valencia San Vicente Martir, Dept Biostat & Invest, C Quevedo 2, Valencia 46001, Spain
关键词
maximum likelihood principal component analysis; missing data; regression-based methods; PCA model building; trimmed scores regression; PRINCIPAL COMPONENT ANALYSIS; CURVE RESOLUTION;
D O I
10.1002/cem.2804
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Maximum likelihood principal component analysis (MLPCA) was originally proposed to incorporate measurement error variance information in principal component analysis (PCA) models. MLPCA can be used to fit PCA models in the presence of missing data, simply by assigning very large variances to the non-measured values. An assessment of maximum likelihood missing data imputation is performed in this paper, analysing the algorithm of MLPCA and adapting several methods for PCA model building with missing data to its maximum likelihood version. In this way, known data regression (KDR), KDR with principal component regression (PCR), KDR with partial least squares regression (PLS) and trimmed scores regression (TSR) methods are implemented within the MLPCA method to work as different imputation steps. Six data sets are analysed using several percentages of missing data, comparing the performance of the original algorithm, and its adapted regression-based methods, with other state-of-the-art methods. Copyright (c) 2016 John Wiley & Sons, Ltd.
引用
收藏
页码:386 / 393
页数:8
相关论文
共 50 条
  • [41] Missing Data and Multiple Imputation
    Cummings, Peter
    [J]. JAMA PEDIATRICS, 2013, 167 (07) : 656 - 661
  • [42] Missing Data Imputation: A Survey
    Kelkar, Bhagyashri Abhay
    [J]. INTERNATIONAL JOURNAL OF DECISION SUPPORT SYSTEM TECHNOLOGY, 2022, 14 (01)
  • [43] Marginal maximum likelihood estimation of conditional autoregressive models with missing data
    Suesse, Thomas
    Zammit-Mangion, Andrew
    [J]. STAT, 2019, 8 (01):
  • [44] Missing data, imputation, and endogeneity
    McDonough, Ian K.
    Millimet, Daniel L.
    [J]. JOURNAL OF ECONOMETRICS, 2017, 199 (02) : 141 - 155
  • [45] Imputation of Missing Healthcare Data
    Chowdhury, Mohaimanul Hoque
    Islam, Muhammad Kamrul
    Khan, Shahidul Islam
    [J]. 2017 20TH INTERNATIONAL CONFERENCE OF COMPUTER AND INFORMATION TECHNOLOGY (ICCIT), 2017,
  • [46] BAYESIAN IMPUTATION FOR MISSING DATA
    Nads, Azman A.
    Polestico, Daisy Lou L.
    [J]. ADVANCES AND APPLICATIONS IN STATISTICS, 2022, 79 : 83 - 104
  • [47] Multiple imputation for missing data
    Patrician, PA
    [J]. RESEARCH IN NURSING & HEALTH, 2002, 25 (01) : 76 - 84
  • [48] Analyzing data sets with missing data: An empirical evaluation of imputation methods and likelihood-based methods
    Myrtveit, I
    Stensrud, E
    Olsson, UH
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2001, 27 (11) : 999 - 1013
  • [49] Imputation of missing data in surveys
    Rässler, S
    [J]. JAHRBUCHER FUR NATIONALOKONOMIE UND STATISTIK, 2000, 220 (01): : 64 - 94
  • [50] Multiple imputation of missing data
    Lydersen, Stian
    [J]. TIDSSKRIFT FOR DEN NORSKE LAEGEFORENING, 2022, 142 (02) : 151 - 151