A novel density peaks clustering algorithm based on Hopkins statistic

被引:14
|
作者
Zhang, Ruilin [1 ]
Miao, Zhenguo [1 ]
Tian, Ye [1 ]
Wang, Hongpeng [1 ,2 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Shenzhen, Peoples R China
[2] Peng Cheng Lab, Shenzhen, Peoples R China
关键词
Clustering; Cluster validity index (CVI); Cluster center; Hopkins statistic; Density peaks; FAST SEARCH; NUMBER; FIND;
D O I
10.1016/j.eswa.2022.116892
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Density peaks clustering (DPC) is a promising algorithm due to straightforward and easy implementation. However, most of its improvements still rely on expert, strong prior information, or complex iterations to identify the cluster centers, which inevitably adds subjectivity and instability. Moreover, some crisp and sensitive density metrics will sometimes reduce the representativeness of the center, resulting in poor clustering. To this end, we propose an enhanced algorithm, called Density peaks clustering based on Hopkins Statistic. The main property of the method is to realize the automatic identification of cluster centers without prior information. Specifically, with a two-stage strategy, we first specify some objects as candidate centers by linear regression and residual analysis. Subsequently, inspired by optimization idea we design a novel validity index (AHS) instead of the original decision graph to find the desired centers from the candidates. Another novel part of DPC-AHS is that the proposed adjusted-k-nearest neighbors (A-kNN) dynamically defines the neighbors during the process, which further enhances the robustness against outliers. Finally, we compare performance of DPC-AHS with 7 state-of-the-art methods over synthetic, UCI, and image datasets. Experiments on 25 datasets and in-depth discussion cases from 5 perspectives demonstrate that our algorithm is feasible and effective in clustering and center identification.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] A Novel Density Peaks Clustering Algorithm Based on Local Reachability Density
    Hanqing Wang
    Bin Zhou
    Jianyong Zhang
    Ruixue Cheng
    International Journal of Computational Intelligence Systems, 2020, 13 : 690 - 697
  • [2] A Novel Density Peaks Clustering Algorithm Based on Local Reachability Density
    Wang, Hanqing
    Zhou, Bin
    Zhang, Jianyong
    Cheng, Ruixue
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2020, 13 (01) : 690 - 697
  • [3] A Novel Hierarchical Clustering Algorithm Based on Density Peaks for Complex Datasets
    Zhou, Rong
    Zhang, Yong
    Feng, Shengzhong
    Luktarhan, Nurbol
    COMPLEXITY, 2018,
  • [4] A novel density peaks clustering algorithm for mixed data
    Du, Mingjing
    Ding, Shifei
    Xue, Yu
    PATTERN RECOGNITION LETTERS, 2017, 97 : 46 - 53
  • [5] An Improved Density Peaks Clustering Algorithm Based On Density Ratio
    Zou, Yujuan
    Wang, Zhijian
    Xu, Pengfei
    Lv, Taizhi
    COMPUTER JOURNAL, 2024, 67 (07): : 2515 - 2528
  • [6] A spectral clustering algorithm based on attribute fluctuation and density peaks clustering algorithm
    Xin Song
    Shuhua Li
    Ziqiang Qi
    Jianlin Zhu
    Applied Intelligence, 2023, 53 : 10520 - 10534
  • [7] A spectral clustering algorithm based on attribute fluctuation and density peaks clustering algorithm
    Song, Xin
    Li, Shuhua
    Qi, Ziqiang
    Zhu, Jianlin
    APPLIED INTELLIGENCE, 2023, 53 (09) : 10520 - 10534
  • [8] Coflow scheduling algorithm based density peaks clustering
    Li, Chenghao
    Zhang, Huyin
    Zhou, Tianying
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 97 : 805 - 813
  • [9] Cosine kernel based density peaks clustering algorithm
    Wang, Jiayuan
    Lv, Li
    Wu, Runxiu
    Fan, Tanghuai
    Lee, Ivan
    INTERNATIONAL JOURNAL OF COMPUTING SCIENCE AND MATHEMATICS, 2020, 12 (01) : 1 - 20
  • [10] A text clustering algorithm based on find of density peaks
    Liu, Peiyu
    Liu, Yingying
    Hou, Xiuyan
    Li, Qingqing
    Zhu, Zhenfang
    2015 7TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY IN MEDICINE AND EDUCATION (ITME), 2015, : 348 - 352