A two-stage density clustering algorithm

被引:9
|
作者
Wang, Min [1 ]
Zhang, Ying-Yi [1 ]
Min, Fan [2 ,3 ]
Deng, Li-Ping [4 ]
Gao, Lei [2 ]
机构
[1] Southwest Petr Univ, Sch Elect Engn & Informat, Chengdu 610500, Peoples R China
[2] Southwest Petr Univ, Sch Comp Sci, Chengdu 610500, Peoples R China
[3] Southwest Petr Univ, Inst Artificial Intelligence, Chengdu 610500, Peoples R China
[4] China West Normal Univ, Sch Comp Sci & Technol, Nanchong 637002, Peoples R China
基金
中国国家自然科学基金;
关键词
Clustering; Density peak; Efficiency;
D O I
10.1007/s00500-020-05028-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering by fast search and find of density peaks (CFDP) is a popular density-based algorithm. However, it is criticized because it is inefficient and applicable only to some types of data, and requires the manual setting of the key parameter. In this paper, we propose the two-stage density clustering algorithm, which takes advantage of granular computing to address the aforementioned issues. The new algorithm is highly efficient, adaptive to various types of data, and requires minimal parameter setting. The first stage uses the two-round-means algorithm to obtain root n small blocks, where n is the number of instances. This stage decreases the data size directly from n to root n. The second stage constructs the master tree and obtains the final blocks. This stage borrows the structure of CFDP, while the cutoff distance parameter is not required. The time complexity of the algorithm is O(mn(3/2)), which is lower than O(mn(2)) for CFDP. We report the results of some experiments performed on 21 datasets from various domains to compare a new clustering algorithm with some state-of-the-art clustering algorithms. The results demonstrated that the new algorithm is adaptive to different types of datasets. It is two or more orders of magnitude faster than CFDP.
引用
收藏
页码:17797 / 17819
页数:23
相关论文
共 50 条
  • [1] A two-stage density clustering algorithm
    Min Wang
    Ying-Yi Zhang
    Fan Min
    Li-Ping Deng
    Lei Gao
    [J]. Soft Computing, 2020, 24 : 17797 - 17819
  • [2] On a two-stage progressive clustering algorithm with graph-augmented density peak clustering
    Niu, Xinzheng
    Zheng, Yunhong
    Liu, Wuji
    Wu, Chase Q.
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 108
  • [3] A Two-Stage Clustering Algorithm based on Improved K-means and Density Peak Clustering
    Xiao, Na
    Zhou, Xu
    Huang, Xin
    Yang, Zhibang
    [J]. 2019 10TH IEEE INTERNATIONAL CONFERENCE ON BIG KNOWLEDGE (ICBK 2019), 2019, : 296 - 301
  • [4] CHRONICLE: A Two-Stage Density-Based Clustering Algorithm for Dynamic Networks
    Kim, Min-Soo
    Han, Jiawei
    [J]. DISCOVERY SCIENCE, PROCEEDINGS, 2009, 5808 : 152 - 167
  • [5] A two-stage genetic algorithm for automatic clustering
    He, Hong
    Tan, Yonghong
    [J]. NEUROCOMPUTING, 2012, 81 : 49 - 59
  • [6] A method of two-stage clustering learning based on improved DBSCAN and density peak algorithm
    Li, Mingyang
    Bi, Xinhua
    Wang, Limin
    Han, Xuming
    [J]. COMPUTER COMMUNICATIONS, 2021, 167 : 75 - 84
  • [7] Two-Stage Clustering with k-Means Algorithm
    Salman, Raied
    Kecman, Vojislav
    Li, Qi
    Strack, Robert
    Test, Erick
    [J]. RECENT TRENDS IN WIRELESS AND MOBILE NETWORKS, 2011, 162 : 110 - 122
  • [8] Study on fuzzy clustering based on two-stage algorithm
    Huang, Li-min
    Wu, Xiao-jun
    Wang, Shi-tong
    [J]. Huadong Chuanbo Gongye Xueyuan Xuebao/Journal of East China Shipbuilding Institute, 2000, 14 (02): : 74 - 77
  • [9] Two-stage clustering algorithm based on evolution and propagation patterns
    Peng Li
    Haibin Xie
    [J]. Applied Intelligence, 2022, 52 : 11555 - 11568
  • [10] Two-stage clustering algorithm based on evolution and propagation patterns
    Li, Peng
    Xie, Haibin
    [J]. APPLIED INTELLIGENCE, 2022, 52 (10) : 11555 - 11568