Missing-data theory in the context of exploratory data analysis

被引:23
|
作者
Camacho, Jose [1 ]
机构
[1] Univ Granada, Dept Teoria Senal Telemat & Comunicac, E-18071 Granada, Spain
关键词
Exploratory data analysis; Missing data; Data understanding; Latent structures; Correlation matrix; Rotation; Variable selection; PRINCIPAL COMPONENTS; FRAMEWORK; PLS;
D O I
10.1016/j.chemolab.2010.04.017
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes a new method for exploratory analysis and the interpretation of latent structures. The approach is named missing-data methods for exploratory data analysis (MEDA). The MEDA approach can be applied in combination with several models, including Principal Components Analysis (PCA), Factor Analysis (FA) and Partial Least Squares (PLS). It can be seen as a substitute of rotation methods with better properties associated: it is more accurate than rotation methods in the detection of relationships between pairs of variables, it is robust to the overestimation of the number of PCs and it does not depend on the normalization of the loadings. MEDA is useful to infer the structure in the data and also to interpret the contribution of each latent variable. The interpretation of PLS models with MEDA, including variables selection, may be specially valuable for the chemometrics community. The use of MEDA with PCA and PLS models is demonstrated with several simulated and real examples. (C) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:8 / 18
页数:11
相关论文
共 50 条
  • [41] Model Selection Criteria for Missing-Data Problems Using the EM Algorithm
    Ibrahim, Joseph G.
    Zhu, Hongtu
    Tang, Niansheng
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2008, 103 (484) : 1648 - 1658
  • [42] A primer on the use of modern missing-data methods in psychosomatic medicine research
    Enders, Craig K.
    [J]. PSYCHOSOMATIC MEDICINE, 2006, 68 (03) : 427 - 436
  • [43] Handling missing values in exploratory multivariate data analysis methods
    Josse, Julie
    Husson, Francois
    [J]. JOURNAL OF THE SFDS, 2012, 153 (02): : 79 - 99
  • [44] Missing data in the forensic context
    Kadane, JB
    Terrin, N
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 1997, 160 : 351 - 357
  • [45] A unified approach to exploratory factor analysis with missing data, nonnormal data, and in the presence of outliers
    Ke-Hai Yuan
    Linda L. Marshall
    Peter M. Bentler
    [J]. Psychometrika, 2002, 67 : 95 - 121
  • [46] Unified approach to exploratory factor analysis with missing data, nonnormal data, and in the presence of outliers
    Yuan, KH
    Marshall, LL
    Bentler, PM
    [J]. PSYCHOMETRIKA, 2002, 67 (01) : 95 - 121
  • [47] COMBINING MISSING-DATA RECONSTRUCTION AND UNCERTAINTY DECODING FOR ROBUST SPEECH RECOGNITION
    Gonzalez, Jose A.
    Peinado, Antonio M.
    Gomez, Angel M.
    Ma, Ning
    Barker, Jon
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4693 - 4696
  • [48] Conditions for Ignoring the Missing-Data Mechanism in Likelihood Inferences for Parameter Subsets
    Little, Roderick J.
    Rubin, Donald B.
    Zangeneh, Sahar Z.
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2017, 112 (517) : 314 - 320
  • [49] The MIDAS Touch: Accurate and Scalable Missing-Data Imputation with Deep Learning
    Lall, Ranjit
    Robinson, Thomas
    [J]. POLITICAL ANALYSIS, 2022, 30 (02): : 179 - 196
  • [50] MIGHT: Statistical Methodology for Missing-Data Imputation in Food Composition Databases
    Ispirova, Gordana
    Eftimov, Tome
    Korosec, Peter
    Seljak, Barbara Korousic
    [J]. APPLIED SCIENCES-BASEL, 2019, 9 (19):