Automatic extraction of clusters from hierarchical clustering representations

被引:0
|
作者
Sander, J [1 ]
Qin, XJ [1 ]
Lu, ZY [1 ]
Niu, N [1 ]
Kovarsky, A [1 ]
机构
[1] Univ Alberta, Dept Comp Sci, Edmonton, AB T6G 2E8, Canada
关键词
hierarchical clustering; OPTICS; single-link method; dendrogram; reachability-plot;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hierarchical clustering algorithms are typically more effective in detecting the true clustering structure of a data set than partitioning algorithms. However, hierarchical clustering algorithms do not actually create clusters, but compute only a hierarchical representation, of the data set. This makes them unsuitable as an automatic pre-processing step for other algorithms that operate on detected clusters. This is true for both dendrograms and reachability plots, which have been proposed as hierarchical clustering representations, and which have different advantages and disadvantages. In this paper we first investigate the relation between dendrograms and reachability plots and introduce methods to convert them into each other showing that they essentially contain the same information. Based on reachability plots, we then introduce a technique that automatically determines the significant clusters in a hierarchical cluster representation. This makes it for the first time possible to use hierarchical clustering as an automatic pre-processing step that requires no user interaction to select clusters from a hierarchical cluster representation.
引用
收藏
页码:75 / 87
页数:13
相关论文
共 50 条
  • [21] Indexes to Find the Optimal Number of Clusters in a Hierarchical Clustering
    David Martin-Fernandez, Jose
    Maria Luna-Romera, Jose
    Pontes, Beatriz
    Riquelme-Santos, Jose C.
    14TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING MODELS IN INDUSTRIAL AND ENVIRONMENTAL APPLICATIONS (SOCO 2019), 2020, 950 : 3 - 13
  • [22] A decision criterion for the optimal number of clusters in hierarchical clustering
    Jung, J
    Park, H
    Du, DZ
    Drake, BL
    JOURNAL OF GLOBAL OPTIMIZATION, 2003, 25 (01) : 91 - 111
  • [23] DISTANCES BETWEEN CLUSTERS IN THE AGGLOMERATIVE HIERARCHICAL CLUSTERING OF STRINGS
    Eremic, Zeljko
    Radosav, Dragica
    METALURGIA INTERNATIONAL, 2012, 17 (08): : 67 - 74
  • [24] AUTOMATIC GENERATION OF ONTOLOGIES: A HIERARCHICAL WORD CLUSTERING APPROACH
    Sellah, Smail
    Hilaire, Vincent
    IADIS-INTERNATIONAL JOURNAL ON COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2018, 13 (02): : 76 - 92
  • [25] Hierarchical Language Identification based on Automatic Language Clustering
    Yin, Bo
    Ambikairajah, Eliathamby
    Chen, Fang
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1217 - 1220
  • [26] Automatic synthesis of synergies for control of reaching - hierarchical clustering
    Jovovic, M
    Jonic, S
    Popovic, D
    MEDICAL ENGINEERING & PHYSICS, 1999, 21 (05) : 329 - 341
  • [27] Automatic synthesis of synergies for control of reaching - Hierarchical clustering
    Jovović, Milan
    Jonić, Slavica
    Popović, Dejan
    Medical Engineering and Physics, 1999, 21 (05): : 329 - 341
  • [28] Automatic hierarchical clustering algorithm for remote sensing data
    Sidorova V.S.
    Pattern Recognition and Image Analysis, 2011, 21 (02) : 328 - 331
  • [29] Using topic keyword clusters for automatic document clustering
    Chang, HC
    Hsu, CC
    THIRD INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND APPLICATIONS, VOL 1, PROCEEDINGS, 2005, : 419 - 424
  • [30] Distributed Fuzzy Clustering with Automatic Detection of the Number of Clusters
    Vendramin, L.
    Campello, R. J. G. B.
    Coletta, L. F. S.
    Hruschka, E. R.
    INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE, 2011, 91 : 133 - 140