Efficient Density-peaks Clustering Algorithms on Static and Dynamic Data in Euclidean Space

被引:2
|
作者
Amagata, Daichi [1 ]
Hara, Takahiro [1 ]
机构
[1] Osaka Univ, Suita, Osaka, Japan
关键词
Density-peaks clustering; parallel algorithms; multi-dimensional points; SEARCH;
D O I
10.1145/3607873
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Clustering multi-dimensional points is a fundamental task in many fields, and density-based clustering supports many applications because it can discover clusters of arbitrary shapes. This article addresses the problem of Density-Peaks Clustering (DPC) in Euclidean space. DPC already has many applications, but its straightforward implementation incurs O(n(2)) time, where n is the number of points, thereby does not scale to large datasets. To enable DPC on large datasets, we first propose empirically efficient exact DPC algorithm, Ex-DPC. Although this algorithm is much faster than the straightforward implementation, it still suffers from O(n(2)) time theoretically. We hence propose a new exact algorithm, Ex-DPC++, that runs in o(n(2)) time. We accelerate their efficiencies by leveraging multi-threading. Moreover, real-world datasets may have arbitrary updates (point insertions and deletions). It is hence important to support efficient cluster updates. To this end, we propose D-DPC for fully dynamic DPC. We conduct extensive experiments using real datasets, and our experimental results demonstrate that our algorithms are efficient and scalable.
引用
收藏
页数:27
相关论文
共 50 条
  • [1] FIBER SEGMENTATION USING A DENSITY-PEAKS CLUSTERING ALGORITHM
    Chen, Pingjun
    Fan, Xin
    Liu, Ruiyang
    Tang, Xianxuan
    Cheng, Hua
    2015 IEEE 12TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI), 2015, : 633 - 637
  • [2] Fast Density-Peaks Clustering: Multicore-based Parallelization Approach
    Amagata, Daichi
    Hara, Takahiro
    SIGMOD '21: PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2021, : 49 - 61
  • [3] Adaptive Partitioning by Local Density-Peaks: An Efficient Density-Based Clustering Algorithm for Analyzing Molecular Dynamics Trajectories
    Liu, Song
    Zhu, Lizhe
    Sheong, Fu Kit
    Wang, Wei
    Huang, Xuhui
    JOURNAL OF COMPUTATIONAL CHEMISTRY, 2017, 38 (03) : 152 - 160
  • [4] Review of Fast Density-Peaks Clustering and Its Application to Pediatric White Matter Tracts
    Cheng, Shichao
    Duan, Yuzhuo
    Fan, Xin
    Zhang, Dongyu
    Cheng, Hua
    MEDICAL IMAGE UNDERSTANDING AND ANALYSIS (MIUA 2017), 2017, 723 : 436 - 447
  • [5] Online Active Learning Framework for Data Stream Classification With Density-Peaks Recognition
    Zhang, Kuangyan
    Liu, Sanmin
    Chen, Yanfei
    IEEE ACCESS, 2023, 11 : 27853 - 27864
  • [6] Fast density-peaks clustering for registration-free pediatric white matter tract analysis
    Fan, Xin
    Duan, Yuzhuo
    Cheng, Shichao
    Zhang, Yuxi
    Cheng, Hua
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2019, 96 : 1 - 11
  • [7] Efficient Distributed Density Peaks for Clustering Large Data Sets in MapReduce
    Zhang, Yanfeng
    Chen, Shimin
    Yu, Ge
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (12) : 3218 - 3230
  • [8] Clustering by finding prominent peaks in density space
    Ni, Li
    Luo, Wenjian
    Zhu, Wenjie
    Liu, Wenjie
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2019, 85 : 727 - 739
  • [9] Efficient Algorithms for Density Decomposition on Large Static and Dynamic Graphs
    Zhang, Yalong
    Li, Rong-Hua
    Zhang, Qi
    Qin, Hongchao
    Wang, Guoren
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2024, 17 (11): : 2933 - 2945
  • [10] An improved density peaks method for data clustering
    Lotfi, Abdulrahman
    Seyedi, Seyed Amjad
    Moradi, Parham
    2016 6TH INTERNATIONAL CONFERENCE ON COMPUTER AND KNOWLEDGE ENGINEERING (ICCKE), 2016, : 263 - 268