Missing-data theory in the context of exploratory data analysis

被引:23
|
作者
Camacho, Jose [1 ]
机构
[1] Univ Granada, Dept Teoria Senal Telemat & Comunicac, E-18071 Granada, Spain
关键词
Exploratory data analysis; Missing data; Data understanding; Latent structures; Correlation matrix; Rotation; Variable selection; PRINCIPAL COMPONENTS; FRAMEWORK; PLS;
D O I
10.1016/j.chemolab.2010.04.017
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes a new method for exploratory analysis and the interpretation of latent structures. The approach is named missing-data methods for exploratory data analysis (MEDA). The MEDA approach can be applied in combination with several models, including Principal Components Analysis (PCA), Factor Analysis (FA) and Partial Least Squares (PLS). It can be seen as a substitute of rotation methods with better properties associated: it is more accurate than rotation methods in the detection of relationships between pairs of variables, it is robust to the overestimation of the number of PCs and it does not depend on the normalization of the loadings. MEDA is useful to infer the structure in the data and also to interpret the contribution of each latent variable. The interpretation of PLS models with MEDA, including variables selection, may be specially valuable for the chemometrics community. The use of MEDA with PCA and PLS models is demonstrated with several simulated and real examples. (C) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:8 / 18
页数:11
相关论文
共 50 条
  • [1] A Comparison of Missing-Data Imputation Techniques in Exploratory Factor Analysis
    Xiao, Canhua
    Bruner, Deborah W.
    Dai, Tian
    Guo, Ying
    Hanlon, Alexandra
    [J]. JOURNAL OF NURSING MEASUREMENT, 2019, 27 (02) : 313 - 334
  • [2] Planned missing-data designs in analysis of change
    Graham, JW
    Taylor, BJ
    Cumsille, PE
    [J]. NEW METHODS FOR THE ANALYSIS OF CHANGE, 2001, : 335 - 353
  • [3] Missing-data model of vowel identification
    de Cheveigné, A
    Kawahara, H
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1999, 105 (06): : 3497 - 3508
  • [4] Missing-Data Nonparametric Coherency Estimation
    Haley, C.
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 1704 - 1708
  • [5] MISSING-DATA ADJUSTMENTS IN LARGE SURVEYS
    LITTLE, RJA
    [J]. JOURNAL OF BUSINESS & ECONOMIC STATISTICS, 1988, 6 (03) : 287 - 296
  • [6] Analysis of NMAR missing data without specifying missing-data mechanisms in a linear latent variate model
    Kano, Yutaka
    Takai, Keiji
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2011, 102 (09) : 1241 - 1255
  • [7] Partial and latent ignorability in missing-data problems
    Harel, Ofer
    Schafer, Joseph L.
    [J]. BIOMETRIKA, 2009, 96 (01) : 37 - 50
  • [8] Correcting missing-data bias in historical demography
    Jonker, M. A.
    van der Vaart, A. W.
    [J]. POPULATION STUDIES-A JOURNAL OF DEMOGRAPHY, 2007, 61 (01): : 99 - 113
  • [9] Statistical analysis with missing exposure data measured by proxy respondents: a misclassification problem within a missing-data problem
    Shardell, Michelle
    Hicks, Gregory E.
    [J]. STATISTICS IN MEDICINE, 2014, 33 (25) : 4437 - 4452
  • [10] Parameter Estimation Algorithms for Missing-Data Systems
    Ding, Feng
    Ding, Jie
    [J]. 2009 AMERICAN CONTROL CONFERENCE, VOLS 1-9, 2009, : 5032 - 5036