A novel density peaks clustering algorithm based on Hopkins statistic

被引:14
|
作者
Zhang, Ruilin [1 ]
Miao, Zhenguo [1 ]
Tian, Ye [1 ]
Wang, Hongpeng [1 ,2 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Shenzhen, Peoples R China
[2] Peng Cheng Lab, Shenzhen, Peoples R China
关键词
Clustering; Cluster validity index (CVI); Cluster center; Hopkins statistic; Density peaks; FAST SEARCH; NUMBER; FIND;
D O I
10.1016/j.eswa.2022.116892
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Density peaks clustering (DPC) is a promising algorithm due to straightforward and easy implementation. However, most of its improvements still rely on expert, strong prior information, or complex iterations to identify the cluster centers, which inevitably adds subjectivity and instability. Moreover, some crisp and sensitive density metrics will sometimes reduce the representativeness of the center, resulting in poor clustering. To this end, we propose an enhanced algorithm, called Density peaks clustering based on Hopkins Statistic. The main property of the method is to realize the automatic identification of cluster centers without prior information. Specifically, with a two-stage strategy, we first specify some objects as candidate centers by linear regression and residual analysis. Subsequently, inspired by optimization idea we design a novel validity index (AHS) instead of the original decision graph to find the desired centers from the candidates. Another novel part of DPC-AHS is that the proposed adjusted-k-nearest neighbors (A-kNN) dynamically defines the neighbors during the process, which further enhances the robustness against outliers. Finally, we compare performance of DPC-AHS with 7 state-of-the-art methods over synthetic, UCI, and image datasets. Experiments on 25 datasets and in-depth discussion cases from 5 perspectives demonstrate that our algorithm is feasible and effective in clustering and center identification.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] DPCG: an efficient density peaks clustering algorithm based on grid
    Xiao Xu
    Shifei Ding
    Mingjing Du
    Yu Xue
    International Journal of Machine Learning and Cybernetics, 2018, 9 : 743 - 754
  • [22] An Improved Density Peaks-Based Graph Clustering Algorithm
    Chen, Lei
    Zheng, Heding
    Liu, Zhaohua
    Li, Qing
    Guo, Lian
    Liang, Guangsheng
    ADVANCES IN INTERNET, DATA & WEB TECHNOLOGIES (EIDWT-2022), 2022, 118 : 68 - 80
  • [23] Density Peaks Based Clustering Algorithm for Overlapping Community Detection
    Liu, Hongtao
    Zhao, Chaoyue
    Tian, Yuan
    Yang, Juan
    PROCEEDINGS OF 2016 12TH INTERNATIONAL CONFERENCE ON SEMANTICS, KNOWLEDGE AND GRIDS (SKG), 2016, : 1 - 8
  • [24] An improved density peaks clustering based on sparrow search algorithm
    Chen, Yaru
    Zhou, Jie
    He, Xingshi
    Luo, Xinglong
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (08): : 11017 - 11037
  • [25] Density Peaks Clustering Based on Improved RNA Genetic Algorithm
    Ren, Liyan
    Zang, Wenke
    HUMAN CENTERED COMPUTING, HCC 2017, 2018, 10745 : 28 - 33
  • [26] DPCG: an efficient density peaks clustering algorithm based on grid
    Xu, Xiao
    Ding, Shifei
    Du, Mingjing
    Xue, Yu
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2018, 9 (05) : 743 - 754
  • [27] Density Peaks Clustering Algorithm Based on K Nearest Neighbors
    Yin, Shihao
    Wu, Runxiu
    Li, Peiwu
    Liu, Baohong
    Fu, Xuefeng
    ADVANCES IN INTELLIGENT SYSTEMS AND COMPUTING (ECC 2021), 2022, 268 : 129 - 144
  • [28] Hierarchical clustering algorithm based on natural local density peaks
    Cai, Fapeng
    Feng, Ji
    Yang, Degang
    Chen, Zhongshang
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (11) : 7989 - 8004
  • [29] Optimized Density Peaks Clustering Algorithm Based on Dissimilarity Measure
    Ding S.-F.
    Xu X.
    Wang Y.-R.
    Ruan Jian Xue Bao/Journal of Software, 2020, 31 (11): : 3321 - 3333
  • [30] A novel density deviation multi-peaks automatic clustering algorithm
    Zhou, Wei
    Wang, Limin
    Han, Xuming
    Parmar, Milan
    Li, Mingyang
    COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (01) : 177 - 211