Using principal component analysis and correspondence analysis for estimation in latent variable models

被引：10

作者：

Lynn, HS ^{[1
]}

McCulloch, CE

机构：

[1] Rho Inc, Chapel Hill, NC 27514 USA

[2] Cornell Univ, Biometr Unit, Dept Stat Sci, Ithaca, NY 14853 USA

来源：

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION | 2000年 / 95卷 / 450期

关键词：

consistency; correspondence analysis; incidental parameters; principal component analysis;

D O I：

10.2307/2669399

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

Correspondence analysis (CA) and principal component analysis (PCA) are often used to describe multivariate data. In certain applications they have been used for estimation in latent variable models. The theoretical basis for such inference is assessed in generalized linear models where the linear predictor equals alpha(j) + x(i)beta(j) or a(j) - b(j) (x(i) - u(j))(2), (i = 1, ..., n; j = 1, ..., m), and x(i) is treated as a latent fixed effect. The PCA and CA eigenvectors/column scores are evaluated as estimators of beta(j) and u(j) and as estimators of u(j). With m fixed and n up arrow infinity, consistent estimators cannot be obtained due to the incidental parameters problem unless sufficient "moment" conditions are imposed on x(i). PCA is equivalent to maximum likelihood estimation for the linear Gaussian model and gives a consistent estimator of beta(j) (up to a scale change) when the second sample moment of x(i) is positive and finite in the limit. It is inconsistent for Poisson and Bernoulli distributions, but when b(j) is constant, its first and/or second eigenvectors can consistently estimate u(j) (up to a location and scale change) for the quadratic Gaussian model. In contrast, the CA estimator is always inconsistent. For finite samples, however, the CA column scores often have high correlations with the u(j)'s, especially when the response curves are spread out relative to one another. The correlations obtained from PCA are usually weaker, although the second PCA eigenvector can sometimes do much better than the first eigenvector, and for incidence data with tightly clustered response curves its performance is comparable to that of CA. For small sample sizes, PCA and particularly CA are competitive alternatives to maximum likelihood and may be preferred because of their computational ease.

引用

页码：561 / 572

页数：12

共 50 条

[1] Probabilistic non-linear principal component analysis with Gaussian process latent variable models
Lawrence, N
JOURNAL OF MACHINE LEARNING RESEARCH, 2005, 6 : 1783 - 1816
[2] On detrending in correspondence analysis and principal component analysis
Karadzic, B
ECOSCIENCE, 1999, 6 (01): : 110 - 116
[3] Brain Shape Correspondence Analysis Using Variational Mixtures for Gaussian Process Latent Variable Models
Minoli, Juan P., V
Orozco, Alvaro A.
Porras-Hurtado, Gloria L.
Garcia, Hernan F.
ARTIFICIAL INTELLIGENCE IN NEUROSCIENCE: AFFECTIVE ANALYSIS AND HEALTH APPLICATIONS, PT I, 2022, 13258 : 547 - 556
[4] Estimation of dynamic models on the factors of marginal principal component analysis
Mayo, Nathanael
STATISTICS, 2011, 45 (01) : 101 - 120
[5] Degrees of freedom estimation in Principal Component Analysis and Consensus Principal Component Analysis
Hassani, Sahar
Martens, Harald
Qannari, El Mostafa
Kohler, Achim
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2012, 118 : 246 - 259
[6] Improving Principal Component Analysis using Bayesian estimation
Nounou, MN
Bakshi, BR
Goel, PK
Shen, XT
PROCEEDINGS OF THE 2001 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2001, : 3666 - 3671
[7] ON THE DISTRIBUTION OF THE LATENT VECTORS FOR PRINCIPAL COMPONENT ANALYSIS
SUGIYAMA, T
ANNALS OF MATHEMATICAL STATISTICS, 1965, 36 (06): : 1875 - 1876
[8] NONLINEAR PROBABILISTIC LATENT VARIABLE MODELS FOR GROUPWISE CORRESPONDENCE ANALYSIS IN BRAIN STRUCTURES
Garcia, Hernan F.
Orozco, Alvaro A.
Alvarez, Mauricio A.
2018 IEEE 28TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2018,
[9] Dynamic heart rate estimation using principal component analysis
Yu, Yong-Poh
Raveendran, P.
Lim, Chern-Loon
Kwan, Ban-Hoe
BIOMEDICAL OPTICS EXPRESS, 2015, 6 (11): : 4610 - 4618
[10] Fault detection and estimation using kernel principal component analysis
Kallas, Maya
Mourot, Gilles
Anani, Kwami
Ragot, Jose
Maquin, Didier
IFAC PAPERSONLINE, 2017, 50 (01): : 1025 - 1030

← 1 2 3 4 5 →