A Comparative Study of Divisive and Agglomerative Hierarchical Clustering Algorithms

被引:73
|
作者
Roux, Maurice [1 ]
机构
[1] Aix Marseille Univ, Marseille, France
关键词
Hierarchical clustering; Dissimilarity data; Splitting procedures; Evaluation of hierarchy; Dendrogram; Ultrametrics; VALIDATION;
D O I
10.1007/s00357-018-9259-9
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
A general scheme for divisive hierarchical clustering algorithms is proposed. It is made of three main steps: first a splitting procedure for the subdivision of clusters into two subclusters, second a local evaluation of the bipartitions resulting from the tentative splits and, third, a formula for determining the node levels of the resulting dendrogram. A set of 12 such algorithms is presented and compared to their agglomerative counterpart (when available). These algorithms are evaluated using the Goodman-Kruskal correlation coefficient. As a global criterion it is an internal goodness-of-fit measure based on the set order induced by the hierarchy compared to the order associated with the given dissimilarities. Applied to a hundred random data tables and to three real life examples, these comparisons are in favor of methods which are based on unusual ratio-type formulas to evaluate the intermediate bipartitions, namely the Silhouette formula, the Dunn's formula and the Mollineda et al. formula. These formulas take into account both the within cluster and the between cluster mean dissimilarities. Their use in divisive algorithms performs very well and slightly better than in their agglomerative counterpart.
引用
收藏
页码:345 / 366
页数:22
相关论文
共 50 条
  • [1] A Comparative Study of Divisive and Agglomerative Hierarchical Clustering Algorithms
    Maurice Roux
    [J]. Journal of Classification, 2018, 35 : 345 - 366
  • [2] Agglomerative and divisive hierarchical Bayesian clustering
    Burghardt, Elliot
    Sewell, Daniel
    Cavanaugh, Joseph
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2022, 176
  • [3] Hierarchical subtrees agglomerative clustering algorithms
    Beijing Municipal Key Laboratory of Multimedia and Intelligent Software Technology, College of Computer Science and Technology, Beijing University of Technology, Beijing 100022, China
    [J]. Beijing Gongye Daxue Xuebao J. Beijing Univ. Technol., 2006, 5 (442-446):
  • [4] Fair Algorithms for Hierarchical Agglomerative Clustering
    Chhabra, Anshuman
    Mohapatra, Prasant
    [J]. 2022 21ST IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA, 2022, : 206 - 211
  • [5] Geometric algorithms for agglomerative hierarchical clustering
    Chen, DZ
    Xu, B
    [J]. COMPUTING AND COMBINATORICS, PROCEEDINGS, 2003, 2697 : 30 - 39
  • [6] Constrained Agglomerative Hierarchical Clustering Algorithms with Penalties
    Miyamoto, Sadaaki
    Terami, Akihisa
    [J]. IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ 2011), 2011, : 422 - 427
  • [7] A general framework for agglomerative hierarchical clustering algorithms
    Gil-Garcia, Reynaldo J.
    Badia-Contelles, Josd M.
    Pons-Porrata, Aurora
    [J]. 18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2006, : 569 - 572
  • [8] Asymmetric Agglomerative Hierarchical Clustering Algorithms and Their Evaluations
    Akinobu Takeuchi
    Takayuki Saito
    Hiroshi Yadohisa
    [J]. Journal of Classification, 2007, 24 : 123 - 143
  • [9] Asymmetric agglomerative hierarchical clustering algorithms and their evaluations
    Takeuchi, Akinobu
    Saito, Takayuki
    Yadohisa, Hiroshi
    [J]. JOURNAL OF CLASSIFICATION, 2007, 24 (01) : 123 - 143
  • [10] Hesitant fuzzy agglomerative hierarchical clustering algorithms
    Zhang, Xiaolu
    Xu, Zeshui
    [J]. INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2015, 46 (03) : 562 - 576