A Comparative Study of Divisive and Agglomerative Hierarchical Clustering Algorithms

被引:0
|
作者
Maurice Roux
机构
[1] Faculté des Sciences de St-Jérôme,IMBE (Aix Marseille Université, CNRS, IRD, Univ Avignon)
来源
Journal of Classification | 2018年 / 35卷
关键词
Hierarchical clustering; Dissimilarity data; Splitting procedures; Evaluation of hierarchy; Dendrogram; Ultrametrics;
D O I
暂无
中图分类号
学科分类号
摘要
A general scheme for divisive hierarchical clustering algorithms is proposed. It is made of three main steps: first a splitting procedure for the subdivision of clusters into two subclusters, second a local evaluation of the bipartitions resulting from the tentative splits and, third, a formula for determining the node levels of the resulting dendrogram. A set of 12 such algorithms is presented and compared to their agglomerative counterpart (when available). These algorithms are evaluated using the Goodman-Kruskal correlation coefficient. As a global criterion it is an internal goodness-of-fit measure based on the set order induced by the hierarchy compared to the order associated with the given dissimilarities. Applied to a hundred random data tables and to three real life examples, these comparisons are in favor of methods which are based on unusual ratio-type formulas to evaluate the intermediate bipartitions, namely the Silhouette formula, the Dunn's formula and the Mollineda et al. formula. These formulas take into account both the within cluster and the between cluster mean dissimilarities. Their use in divisive algorithms performs very well and slightly better than in their agglomerative counterpart.
引用
收藏
页码:345 / 366
页数:21
相关论文
共 50 条
  • [21] IMPLEMENTING AGGLOMERATIVE HIERARCHICAL-CLUSTERING ALGORITHMS FOR USE IN DOCUMENT-RETRIEVAL
    VOORHEES, EM
    INFORMATION PROCESSING & MANAGEMENT, 1986, 22 (06) : 465 - 476
  • [23] Avalanche: A Hierarchical, Divisive Clustering Algorithm
    Amalaman, Paul K.
    Eick, Christoph F.
    MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION, MLDM 2015, 2015, 9166 : 296 - 310
  • [24] Agglomerative hierarchical clustering for data with tolerance
    Yasunori, Endo
    Yukihiro, Hamasuna
    Sadaaki, Miyamoto
    GRC: 2007 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, PROCEEDINGS, 2007, : 404 - 409
  • [25] Order preserving hierarchical agglomerative clustering
    Bakkelund, Daniel
    MACHINE LEARNING, 2022, 111 (05) : 1851 - 1901
  • [26] Hierarchical Agglomerative Clustering with Ordering Constraints
    Zhao, Haifeng
    Qi, ZiJie
    THIRD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING: WKDD 2010, PROCEEDINGS, 2010, : 195 - 199
  • [27] Learning the threshold in hierarchical agglomerative clustering
    Daniels, Kristine
    Giraud-Carrier, Christophe
    ICMLA 2006: 5TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2006, : 270 - +
  • [28] Refinement Properties in Agglomerative Hierarchical Clustering
    Miyamoto, Sadaaki
    MODELING DECISIONS FOR ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2009, 5861 : 259 - 267
  • [29] Divisive hierarchical maximum likelihood clustering
    Alok Sharma
    Yosvany López
    Tatsuhiko Tsunoda
    BMC Bioinformatics, 18
  • [30] Order preserving hierarchical agglomerative clustering
    Daniel Bakkelund
    Machine Learning, 2022, 111 : 1851 - 1901