A Comparative Study of Divisive and Agglomerative Hierarchical Clustering Algorithms

被引:0
|
作者
Maurice Roux
机构
[1] Faculté des Sciences de St-Jérôme,IMBE (Aix Marseille Université, CNRS, IRD, Univ Avignon)
来源
Journal of Classification | 2018年 / 35卷
关键词
Hierarchical clustering; Dissimilarity data; Splitting procedures; Evaluation of hierarchy; Dendrogram; Ultrametrics;
D O I
暂无
中图分类号
学科分类号
摘要
A general scheme for divisive hierarchical clustering algorithms is proposed. It is made of three main steps: first a splitting procedure for the subdivision of clusters into two subclusters, second a local evaluation of the bipartitions resulting from the tentative splits and, third, a formula for determining the node levels of the resulting dendrogram. A set of 12 such algorithms is presented and compared to their agglomerative counterpart (when available). These algorithms are evaluated using the Goodman-Kruskal correlation coefficient. As a global criterion it is an internal goodness-of-fit measure based on the set order induced by the hierarchy compared to the order associated with the given dissimilarities. Applied to a hundred random data tables and to three real life examples, these comparisons are in favor of methods which are based on unusual ratio-type formulas to evaluate the intermediate bipartitions, namely the Silhouette formula, the Dunn's formula and the Mollineda et al. formula. These formulas take into account both the within cluster and the between cluster mean dissimilarities. Their use in divisive algorithms performs very well and slightly better than in their agglomerative counterpart.
引用
收藏
页码:345 / 366
页数:21
相关论文
共 50 条
  • [41] Resolving the structure of interactomes with hierarchical agglomerative clustering
    Yongjin Park
    Joel S Bader
    BMC Bioinformatics, 12
  • [42] Competence maps using agglomerative hierarchical clustering
    Ahmad Barirani
    Bruno Agard
    Catherine Beaudry
    Journal of Intelligent Manufacturing, 2013, 24 : 373 - 384
  • [43] An Agglomerative Hierarchical Clustering Framework for Improving the Ensemble Clustering Process
    Jafarzadegan, Mohammad
    Safi-Esfahani, Faramarz
    Beheshti, Zahra
    CYBERNETICS AND SYSTEMS, 2022, 53 (08) : 679 - 701
  • [44] Intelligent control of the hierarchical agglomerative clustering process
    Yager, RR
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2000, 30 (06): : 835 - 845
  • [45] Resolving the structure of interactomes with hierarchical agglomerative clustering
    Park, Yongjin
    Bader, Joel S.
    BMC BIOINFORMATICS, 2011, 12
  • [46] Empirical Comparison of Distances for Agglomerative Hierarchical Clustering
    Tsumoto, Shusaku
    Kimura, Tomohiro
    Iwata, Haruko
    Hirano, Shoji
    INFORMATION PROCESSING AND MANAGEMENT OF UNCERTAINTY IN KNOWLEDGE-BASED SYSTEMS: THEORY AND FOUNDATIONS, PT II, 2018, 854 : 538 - 548
  • [47] Empirical Comparison of Similarities for Agglomerative Hierarchical Clustering
    Tsumoto, Shusaku
    Hirano, Shoji
    Kimura, Tomohiro
    Iwata, Haruko
    2018 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2018, : 3405 - 3410
  • [48] Dynamic Agglomerative-Divisive Clustering of Clickthrough Data for Collaborative Web Search
    Leung, Kenneth Wai-Ting
    Lee, Dik Lun
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PT I, PROCEEDINGS, 2010, 5981 : 635 - 642
  • [49] Customer Segmentation Using Hierarchical Agglomerative Clustering
    Phan Duy Hung
    Nguyen Thi Thuy Lien
    Nguyen Duc Ngoc
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND SYSTEMS (ICISS 2019), 2019, : 33 - 37
  • [50] Competence maps using agglomerative hierarchical clustering
    Barirani, Ahmad
    Agard, Bruno
    Beaudry, Catherine
    JOURNAL OF INTELLIGENT MANUFACTURING, 2013, 24 (02) : 373 - 384