Graphical Methods for Influential Data Points in Cluster Analysis

被引:0
|
作者
Jang, Dae-Heung [1 ]
Kim, Youngil [2 ]
Anderson-Cook, Christine M. [3 ]
机构
[1] Pukyong Natl Univ, Dept Stat, Busan, South Korea
[2] Chung Ang Univ, Sch Business & Econ, Seoul, South Korea
[3] Los Alamos Natl Lab, Stat Sci Grp, Los Alamos, NM USA
关键词
influence matrix; condensed influence plot; 3-D influence plot; row-wise membership movement plot; column-wise membership movement plot;
D O I
10.1002/qre.1744
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In cluster analysis, many numerical measures to detect which data points are influential have been proposed in the past literature. These numerical measures provide only limited information about which data points are influential but fail to reveal deeper relationships between the observations. They describe an overall pattern but fail to provide details about the mechanism that exists among the influential data points. In this paper, several graphical methods are described for detecting this mechanism. In the process, each data point is decomposed to show the pattern, how it influences other observations and the partitioning in cluster analysis. The approach also allows comparison of different clustering methods and how these options impact the relationship between observations. Copyright (c) 2014 John Wiley & Sons, Ltd.
引用
收藏
页码:231 / 239
页数:9
相关论文
共 50 条
  • [41] Graphical Data Analysis With R
    Wainer, Howard
    Friendly, Michael
    Millan-Martinez, Pere
    JOURNAL OF EDUCATIONAL AND BEHAVIORAL STATISTICS, 2015, 40 (06) : 665 - 670
  • [42] GRAPHICAL DATA-ANALYSIS
    WAINER, H
    THISSEN, D
    ANNUAL REVIEW OF PSYCHOLOGY, 1981, 32 : 191 - 241
  • [44] GRAPHICAL ANALYSIS OF FAULTS USING ACCUMULATION POINTS OF STRIAE
    VERGELY, P
    SASSI, W
    CAREYGAILHARDIS, E
    BULLETIN DE LA SOCIETE GEOLOGIQUE DE FRANCE, 1987, 3 (02): : 395 - 402
  • [45] Graphical Methods for the Sensitivity Analysis in Discriminant Analysis
    Jang, Dae-Heung
    Anderson-Cook, Christine M.
    Kim, Youngil
    COMMUNICATIONS FOR STATISTICAL APPLICATIONS AND METHODS, 2015, 22 (05) : 475 - 485
  • [46] Influential Points in Adaptability and Stability Methods Based on Regression Models in Cotton Genotypes
    Nascimento, Moyses
    Teodoro, Paulo Eduardo
    Sant'Anna, Isabela de Castro
    Barroso, Lais Mayara Azevedo
    Nascimento, Ana Carolina Campana
    Azevedo, Camila Ferreira
    Teodoro, Larissa Pereira Ribeiro
    Farias, Francisco Jose Correia
    Almeida, Helaine Claire
    de Carvalho, Luiz Paulo
    AGRONOMY-BASEL, 2021, 11 (11):
  • [47] Quantitative methods of standardization in cluster analysis: finding groups in data
    Nogueira, Andre Luiz
    Munita, Casimiro S.
    JOURNAL OF RADIOANALYTICAL AND NUCLEAR CHEMISTRY, 2020, 325 (03) : 719 - 724
  • [48] A comparison of cluster analysis methods using DNA methylation data
    Siegmund, KD
    Laird, PW
    Laird-Offringa, IA
    BIOINFORMATICS, 2004, 20 (12) : 1896 - 1904
  • [49] Research on Methods and Techniques for IoT Big Data Cluster Analysis
    Bin, Ni
    PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS AND COMPUTER AIDED EDUCATION (ICISCAE 2018), 2018, : 184 - 188
  • [50] APPLICATION OF METHODS OF CLUSTER ANALYSIS TO DATA PROCESSING IN PSYCHOLOGICAL STUDIES
    Savchenko, T. N.
    EKSPERIMENTALNAYA PSIKHOLOGIYA, 2010, 3 (02): : 67 - 86