Assessment of maximum likelihood PCA missing data imputation

被引：15

作者：

Folch-Fortuny, Abel ^{[1
]}

Arteaga, Francisco ^{[2
]}

Ferrer, Alberto ^{[1
]}

机构：

[1] Univ Politecn Valencia, Dept Estadist & Invest Operat Aplicadas & Calidad, Multivariate Stat Engn GIEM, Camino Vera S-N, E-46022 Valencia, Spain

[2] Univ Catolica Valencia San Vicente Martir, Dept Biostat & Invest, C Quevedo 2, Valencia 46001, Spain

来源：

JOURNAL OF CHEMOMETRICS | 2016年 / 30卷 / 07期

关键词：

maximum likelihood principal component analysis; missing data; regression-based methods; PCA model building; trimmed scores regression; PRINCIPAL COMPONENT ANALYSIS; CURVE RESOLUTION;

D O I：

10.1002/cem.2804

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Maximum likelihood principal component analysis (MLPCA) was originally proposed to incorporate measurement error variance information in principal component analysis (PCA) models. MLPCA can be used to fit PCA models in the presence of missing data, simply by assigning very large variances to the non-measured values. An assessment of maximum likelihood missing data imputation is performed in this paper, analysing the algorithm of MLPCA and adapting several methods for PCA model building with missing data to its maximum likelihood version. In this way, known data regression (KDR), KDR with principal component regression (PCR), KDR with partial least squares regression (PLS) and trimmed scores regression (TSR) methods are implemented within the MLPCA method to work as different imputation steps. Six data sets are analysed using several percentages of missing data, comparing the performance of the original algorithm, and its adapted regression-based methods, with other state-of-the-art methods. Copyright (c) 2016 John Wiley & Sons, Ltd.

引用

页码：386 / 393

页数：8

共 50 条

[21] IMPUTATION OF MISSING DATA
Lunt, M.
[J]. ANNALS OF THE RHEUMATIC DISEASES, 2014, 73 : 49 - 49
[22] Empirical likelihood-based inference under imputation for missing response data
Wang, QH
Rao, JNK
[J]. ANNALS OF STATISTICS, 2002, 30 (03): : 896 - 924
[23] Marginal maximum likelihood estimation of SAR models with missing data
Suesse, Thomas
[J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2018, 120 : 98 - 110
[24] Maximum likelihood estimation for dynamic factor models with missing data
Jungbacker, B.
Koopman, S. J.
van der Wel, M.
[J]. JOURNAL OF ECONOMIC DYNAMICS & CONTROL, 2011, 35 (08): : 1358 - 1368
[25] A Primer on Maximum Likelihood Algorithms Available for Use With Missing Data
Enders, Craig K.
[J]. STRUCTURAL EQUATION MODELING-A MULTIDISCIPLINARY JOURNAL, 2001, 8 (01) : 128 - 141
[26] Consequences of Model Misspecification for Maximum Likelihood Estimation with Missing Data
Golden, Richard M.
Henley, Steven S.
White, Halbert
Kashner, T. Michael
[J]. ECONOMETRICS, 2019, 7 (03)
[27] Maximum-likelihood registration of range images with missing data
Sharp, Gregory C.
Lee, Sang W.
Wehe, David K.
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (01) : 120 - 130
[28] MAXIMUM-LIKELIHOOD-ESTIMATION AND LIKELIHOOD RATIO TEST FOR SQUARE TABLES WITH MISSING DATA
SHIH, WJ
[J]. STATISTICS IN MEDICINE, 1987, 6 (01) : 91 - 97
[29] Missing data imputation in quality-of-life assessment - Imputation for WHOQOL-BREF
Lin, Ting Hsiang
[J]. PHARMACOECONOMICS, 2006, 24 (09) : 917 - 925
[30] Maximum likelihood estimation of linear SISO models subject to missing output data and missing input data
Wallin, Ragnar
Hansson, Anders
[J]. INTERNATIONAL JOURNAL OF CONTROL, 2014, 87 (11) : 2354 - 2364

← 1 2 3 4 5 →