A fast density peaks clustering algorithm with sparse search

被引:48
|
作者
Xu, Xiao [1 ]
Ding, Shifei [1 ,2 ]
Wang, Yanru [1 ]
Wang, Lijuan [1 ]
Jia, Weikuan [3 ]
机构
[1] China Univ Min & Technol, Sch Comp Sci & Technol, Xuzhou 221116, Jiangsu, Peoples R China
[2] Minstry Educ Peoples Republ China, Mine Digitizat Engn Res Ctr, Xuzhou 221116, Jiangsu, Peoples R China
[3] Shandong Normal Univ, Sch Informat Sci & Engn, Jinan 250358, Peoples R China
关键词
DPC algorithm; Computational complexity; Sparse search strategy; Fewer distance calculations; Similarity matrix; FIND; SHAPES; NUMBER;
D O I
10.1016/j.ins.2020.11.050
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Given a large unlabeled set of complex data, how to efficiently and effectively group them into clusters remains a challenging problem. Density peaks clustering (DPC) algorithm is an emerging algorithm, which identifies cluster centers based on a decision graph. Without setting the number of cluster centers, DPC can effectively recognize the clusters. However, the similarity between every two data points must be calculated to construct a decision graph, which results in high computational complexity. To overcome this issue, we propose a fast sparse search density peaks clustering (FSDPC) algorithm to enhance the DPC, which constructs a decision graph with fewer similarity calculations to identify cluster centers quickly. In FSDPC, we design a novel sparse search strategy to measure the similarity between the nearest neighbors of each data points. Therefore, FSDPC can enhance the efficiency of the DPC while maintaining satisfactory results. We also propose a novel random third-party data point method to search the nearest neighbors, which introduces no additional parameters or high computational complexity. The experimental results on synthetic datasets and real-world datasets indicate that the proposed algorithm consistently outperforms the DPC and other state-of-the-art algorithms. (C) 2020 Elsevier Inc. All rights reserved.
引用
收藏
页码:61 / 83
页数:23
相关论文
共 50 条
  • [1] Sparse learning based on clustering by fast search and find of density peaks
    Li, Pengqing
    Deng, Xuelian
    Zhang, Leyuan
    Gan, Jiangzhang
    Li, Jiaye
    Li, Yonggang
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (23) : 33261 - 33277
  • [2] Sparse learning based on clustering by fast search and find of density peaks
    Pengqing Li
    Xuelian Deng
    Leyuan Zhang
    Jiangzhang Gan
    Jiaye Li
    Yonggang Li
    [J]. Multimedia Tools and Applications, 2019, 78 : 33261 - 33277
  • [3] ICFS: An Improved Fast Search and Find of Density Peaks Clustering Algorithm
    Gao, Jing
    Zhao, Liang
    Chen, Zhikui
    Li, Peng
    Xu, Han
    Hu, Yueming
    [J]. 2016 IEEE 14TH INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, 14TH INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, 2ND INTL CONF ON BIG DATA INTELLIGENCE AND COMPUTING AND CYBER SCIENCE AND TECHNOLOGY CONGRESS (DASC/PICOM/DATACOM/CYBERSC, 2016, : 537 - 543
  • [4] A Density Peaks Clustering Algorithm With Sparse Search and K-d Tree
    Shan, Yunxiao
    Li, Shu
    Li, Fuxiang
    Cui, Yuxin
    Li, Shuai
    Zhou, Ming
    Li, Xiang
    [J]. IEEE ACCESS, 2022, 10 : 74883 - 74901
  • [5] Clustering by fast search and find of density peaks
    Rodriguez, Alex
    Laio, Alessandro
    [J]. SCIENCE, 2014, 344 (6191) : 1492 - 1496
  • [6] Paralleled fast search and find of density peaks clustering algorithm on GPUs with CUDA
    Li M.
    Huang J.
    Wang J.
    [J]. International Journal of Networked and Distributed Computing, 2016, 4 (3) : 173 - 181
  • [7] A fuzzy mixed data clustering algorithm by fast search and find of density peaks
    Li, Ye
    Chen, Yiyan
    Li, Qun
    [J]. INTELLIGENT DATA ANALYSIS, 2019, 23 : S199 - S224
  • [8] Paralleled Fast Search and Find of Density Peaks Clustering Algorithm on GPUs with CUDA
    Li, Mi
    Huang, Jie
    Wang, Jingpeng
    [J]. 2016 17TH IEEE/ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD), 2016, : 313 - 318
  • [9] A clustering algorithm for fuzzy numbers based on fast search and find of density peaks
    Li, Ye
    Chen, Yiyan
    Li, Qun
    [J]. INTELLIGENT DATA ANALYSIS, 2019, 23 : S25 - S52
  • [10] Fuzzy clustering by fast search and find of density peaks
    Mehmood, Rashid
    Dawood, Hussain
    Bie, Rongfang
    Ahmad, Haseeb
    [J]. 2015 INTERNATIONAL CONFERENCE ON IDENTIFICATION, INFORMATION, AND KNOWLEDGE IN THE INTERNET OF THINGS (IIKI), 2015, : 258 - 261