Density peaks clustering algorithm based on fuzzy and weighted shared neighbor for uneven density datasets

被引:19
|
作者
Zhao, Jia [1 ]
Wang, Gang [1 ]
Pan, Jeng-Shyang [2 ]
Fan, Tanghuai [1 ]
Lee, Ivan [3 ]
机构
[1] Nanchang Inst Technol, Sch Informat Engn, Nanchang 330099, Peoples R China
[2] Shandong Univ Sci & Technol, Coll Comp Sci & Engn, Qingdao 266590, Peoples R China
[3] Univ South Australia, UniSA STEM, Adelaide, SA 5000, Australia
基金
中国国家自然科学基金;
关键词
Uneven density data; Density peaks clustering; Fuzzy neighborhood; K-nearest neighbor; Weighted shared neighbor;
D O I
10.1016/j.patcog.2023.109406
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Uneven density data refers to data with a certain difference in sample density between clusters. The local density of density peaks clustering algorithm (DPC) does not consider the effect of sample den-sity difference between clusters of uneven density data, which may lead to wrong selection of cluster centers; the algorithm allocation strategy makes it easy to incorrectly allocate samples originally belong-ing to sparse clusters to dense clusters, which reduces clustering efficiency. In this study, we proposed the density peaks clustering algorithm based on fuzzy and weighted shared neighbor for uneven density datasets (DPC-FWSN). First, a nearest neighbor fuzzy kernel function is obtained by combining K-nearest neighbor and fuzzy neighborhood. Then, local density is redefined by the nearest neighbor fuzzy ker-nel function. The local density can better characterize the distribution characteristics of the sample by balancing the contribution of sample density in dense and sparse areas, in order to avoid the situation that the sparse cluster does not have a cluster center. Finally, the allocation strategy for weighted shared neighbor similarity is proposed to optimize the sample allocation at the boundary of the sparse cluster. Experiments are performed on IDPC-FA, FKNN-DPC, FNDPC, DPCSA and DPC for uneven density datasets, complex morphologies datasets and real datasets. The clustering results demonstrate that DPC-FWSN ef-fectively handles datasets with uneven density distribution.(c) 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Density peaks clustering algorithm with nearest neighbor optimization for data with uneven density distribution
    Chen W.-C.
    Zhao J.
    Xiao R.-B.
    Wang H.
    Cui Z.-H.
    [J]. Kongzhi yu Juece/Control and Decision, 2024, 39 (03): : 919 - 928
  • [2] Density Peaks Clustering Algorithm Based on Shared Neighbor Degree and Probability Assignment
    Zhu, Hongxiang
    Wu, Genxiu
    Wang, Zhaohui
    [J]. Computer Engineering and Applications, 60 (12): : 74 - 90
  • [3] Multiplex neighbor density peaks clustering for uneven density data sets
    Lü, Li
    Zhu, Mei-Zi
    Kang, Ping
    Han, Long-Zhe
    [J]. Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2024, 41 (10): : 1821 - 1830
  • [4] Batch Incremental Shared Nearest Neighbor Density Based Clustering Algorithm for Dynamic Datasets
    Bhattacharjee, Panthadeep
    Awekar, Amit
    [J]. ADVANCES IN INFORMATION RETRIEVAL, ECIR 2017, 2017, 10193 : 568 - 574
  • [5] Density Peaks Clustering Based on Weighted Local Density Sequence and Nearest Neighbor Assignment
    Yu, Donghua
    Liu, Guojun
    Guo, Maozu
    Liu, Xiaoyan
    Yao, Shuang
    [J]. IEEE ACCESS, 2019, 7 : 34301 - 34317
  • [6] A Weighted Fuzzy Clustering Algorithm Based on Density
    Li, Cuixia
    Tan, Yingjun
    [J]. ADVANCES IN FUTURE COMPUTER AND CONTROL SYSTEMS, VOL 2, 2012, 160 : 205 - +
  • [7] An Improved Clustering Algorithm Based on Density and Shared Nearest Neighbor
    Ye, Hanmin
    Lv, Hao
    Sun, Qianting
    [J]. 2016 IEEE INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC), 2016, : 37 - 40
  • [8] A Novel Hierarchical Clustering Algorithm Based on Density Peaks for Complex Datasets
    Zhou, Rong
    Zhang, Yong
    Feng, Shengzhong
    Luktarhan, Nurbol
    [J]. COMPLEXITY, 2018,
  • [9] Shared-nearest-neighbor-based clustering by fast search and find of density peaks
    Liu, Rui
    Wang, Hong
    Yu, Xiaomei
    [J]. INFORMATION SCIENCES, 2018, 450 : 200 - 226
  • [10] An improved density peaks clustering algorithm based on natural neighbor with a merging strategy
    Ding, Shifei
    Du, Wei
    Xu, Xiao
    Shi, Tianhao
    Wang, Yanru
    Li, Chao
    [J]. INFORMATION SCIENCES, 2023, 624 : 252 - 276