A Greedy Algorithm for Hierarchical Complete Linkage Clustering

被引:0
|
作者
Althaus, Ernst [1 ]
Hildebrandt, Andreas [1 ]
Hildebrandt, Anna Katharina [2 ]
机构
[1] Johannes Gutenberg Univ Mainz, Inst Informat, D-55122 Mainz, Germany
[2] Max Planck Inst Informat, D-66123 Saarbrucken, Germany
来源
关键词
bioinformatics; algorithm-engineering; clustering; unsupervised machine learning; GENE-EXPRESSION DATA; EFFICIENT ALGORITHM; CONFORMATIONS; DYNAMICS; DOCKING;
D O I
暂无
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
We are interested in the greedy method to compute an hierarchical complete linkage clustering. There are two known methods for this problem, one having a running time of O(n(3)) with a space requirement of O(n) and one having a running time of O(n(2) log n) with a space requirement of circle minus(n(2)), where n is the number of points to be clustered. Both methods are not capable to handle large point sets. In this paper, we give an algorithm with a space requirement of O(n) which is able to cluster one million points in a day on current commodity hardware.
引用
收藏
页码:25 / 34
页数:10
相关论文
共 50 条
  • [41] Scalable Single Linkage Hierarchical Clustering For Big Data
    Havens, Timothy C.
    Bezdek, James C.
    Palaniswami, Marimuthu
    2013 IEEE EIGHTH INTERNATIONAL CONFERENCE ON INTELLIGENT SENSORS, SENSOR NETWORKS AND INFORMATION PROCESSING, 2013, : 396 - 401
  • [42] Statistical properties of the single linkage hierarchical clustering estimator
    Zhu, Dekang
    Guralnik, Dan P.
    Wang, Xuezhi
    Li, Xiang
    Moran, Bill
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2017, 185 : 15 - 28
  • [44] Hierarchical Linkage Clustering with Distributions of Distances for Large-Scale Record Linkage
    Ventura, Samuel L.
    Nugent, Rebecca
    PRIVACY IN STATISTICAL DATABASES, PSD 2014, 2014, 8744 : 283 - 298
  • [45] ON EXPONENTIALLY CONSISTENCY OF LINKAGE-BASED HIERARCHICAL CLUSTERING ALGORITHM USING KOLMOGROV-SMIRNOV DISTANCE
    Wang, Tiexing
    Liu, Yang
    Chen, Biao
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 3997 - 4001
  • [46] Optimal algorithms for complete linkage clustering in d dimensions
    Krznaric, D
    Levcopoulos, C
    THEORETICAL COMPUTER SCIENCE, 2002, 286 (01) : 139 - 149
  • [47] Hierarchical link clustering algorithm in networks
    Bodlaj, Jernej
    Batagelj, Vladimir
    PHYSICAL REVIEW E, 2015, 91 (06)
  • [48] Hierarchical clustering algorithm based on granularity
    Liang, Jiuzhen
    Li, Guangbin
    GRC: 2007 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, PROCEEDINGS, 2007, : 429 - 432
  • [49] An adaptive parallel hierarchical clustering algorithm
    Li, Zhaopeng
    Li, Kenli
    Xiao, Degui
    Yang, Lei
    HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, PROCEEDINGS, 2007, 4782 : 97 - 107
  • [50] A TRANSFER ALGORITHM FOR HIERARCHICAL-CLUSTERING
    SCHADER, M
    MATHEMATICAL SOCIAL SCIENCES, 1982, 2 (02) : 189 - 197