Efficient Density-peaks Clustering Algorithms on Static and Dynamic Data in Euclidean Space

被引:2
|
作者
Amagata, Daichi [1 ]
Hara, Takahiro [1 ]
机构
[1] Osaka Univ, Suita, Osaka, Japan
关键词
Density-peaks clustering; parallel algorithms; multi-dimensional points; SEARCH;
D O I
10.1145/3607873
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Clustering multi-dimensional points is a fundamental task in many fields, and density-based clustering supports many applications because it can discover clusters of arbitrary shapes. This article addresses the problem of Density-Peaks Clustering (DPC) in Euclidean space. DPC already has many applications, but its straightforward implementation incurs O(n(2)) time, where n is the number of points, thereby does not scale to large datasets. To enable DPC on large datasets, we first propose empirically efficient exact DPC algorithm, Ex-DPC. Although this algorithm is much faster than the straightforward implementation, it still suffers from O(n(2)) time theoretically. We hence propose a new exact algorithm, Ex-DPC++, that runs in o(n(2)) time. We accelerate their efficiencies by leveraging multi-threading. Moreover, real-world datasets may have arbitrary updates (point insertions and deletions). It is hence important to support efficient cluster updates. To this end, we propose D-DPC for fully dynamic DPC. We conduct extensive experiments using real datasets, and our experimental results demonstrate that our algorithms are efficient and scalable.
引用
收藏
页数:27
相关论文
共 50 条
  • [21] Density peaks clustering based on geodetic distance and dynamic neighbourhood
    Lv, Li
    Wang, Jiayuan
    Wu, Runxiu
    Wang, Hui
    Lee, Ivan
    INTERNATIONAL JOURNAL OF BIO-INSPIRED COMPUTATION, 2021, 17 (01) : 24 - 33
  • [22] Combining density peaks clustering and gravitational search method to enhance data clustering
    Sun, Liping
    Tao, Tao
    Zheng, Xiaoyao
    Bao, Shuting
    Luo, Yonglong
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2019, 85 : 865 - 873
  • [23] Clustering by Fast Search and Find of Density Peaks with Data Field
    WANG Shuliang
    WANG Dakui
    LI Caoyuan
    LI Yan
    DING Gangyi
    ChineseJournalofElectronics, 2016, 25 (03) : 397 - 402
  • [24] Clustering by Fast Search and Find of Density Peaks with Data Field
    Wang Shuliang
    Wang Dakui
    Li Caoyuan
    Li Yan
    Ding Gangyi
    CHINESE JOURNAL OF ELECTRONICS, 2016, 25 (03) : 397 - 402
  • [25] Clustering Mixed Data by Fast Search and Find of Density Peaks
    Liu, Shihua
    Zhou, Bingzhong
    Huang, Decai
    Shen, Liangzhong
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2017, 2017
  • [26] Study on Density Peaks Clustering Algorithm of Vehicle Trajectory Data
    Jiang H.
    Lu B.
    Li A.
    Qiche Gongcheng/Automotive Engineering, 2023, 45 (07): : 1153 - 1162
  • [27] An Efficient Grid-based Clustering Method by Finding Density Peaks
    Wu, Bo
    Wilamowski, B. M.
    PROCEEDINGS OF THE IECON 2016 - 42ND ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2016, : 837 - 842
  • [28] Efficient Clustering Method Based on Density Peaks With Symmetric Neighborhood Relationship
    Wu, Chunrong
    Lee, Jia
    Isokawa, Teijiro
    Yao, Jun
    Xia, Yunni
    IEEE ACCESS, 2019, 7 : 60684 - 60696
  • [29] Dynamic graph-based label propagation for density peaks clustering
    Seyedi, Seyed Amjad
    Lotfi, Abdulrahman
    Moradi, Parham
    Qader, Nooruldeen Nasih
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 115 : 314 - 328
  • [30] STATIC AND DYNAMIC ALGORITHMS FOR K-POINT CLUSTERING PROBLEMS
    DATTA, A
    LENHOF, HP
    SCHWARZ, C
    SMID, M
    JOURNAL OF ALGORITHMS, 1995, 19 (03) : 474 - 503