Density peaks clustering algorithm based on fuzzy and weighted shared neighbor for uneven density datasets

被引:19
|
作者
Zhao, Jia [1 ]
Wang, Gang [1 ]
Pan, Jeng-Shyang [2 ]
Fan, Tanghuai [1 ]
Lee, Ivan [3 ]
机构
[1] Nanchang Inst Technol, Sch Informat Engn, Nanchang 330099, Peoples R China
[2] Shandong Univ Sci & Technol, Coll Comp Sci & Engn, Qingdao 266590, Peoples R China
[3] Univ South Australia, UniSA STEM, Adelaide, SA 5000, Australia
基金
中国国家自然科学基金;
关键词
Uneven density data; Density peaks clustering; Fuzzy neighborhood; K-nearest neighbor; Weighted shared neighbor;
D O I
10.1016/j.patcog.2023.109406
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Uneven density data refers to data with a certain difference in sample density between clusters. The local density of density peaks clustering algorithm (DPC) does not consider the effect of sample den-sity difference between clusters of uneven density data, which may lead to wrong selection of cluster centers; the algorithm allocation strategy makes it easy to incorrectly allocate samples originally belong-ing to sparse clusters to dense clusters, which reduces clustering efficiency. In this study, we proposed the density peaks clustering algorithm based on fuzzy and weighted shared neighbor for uneven density datasets (DPC-FWSN). First, a nearest neighbor fuzzy kernel function is obtained by combining K-nearest neighbor and fuzzy neighborhood. Then, local density is redefined by the nearest neighbor fuzzy ker-nel function. The local density can better characterize the distribution characteristics of the sample by balancing the contribution of sample density in dense and sparse areas, in order to avoid the situation that the sparse cluster does not have a cluster center. Finally, the allocation strategy for weighted shared neighbor similarity is proposed to optimize the sample allocation at the boundary of the sparse cluster. Experiments are performed on IDPC-FA, FKNN-DPC, FNDPC, DPCSA and DPC for uneven density datasets, complex morphologies datasets and real datasets. The clustering results demonstrate that DPC-FWSN ef-fectively handles datasets with uneven density distribution.(c) 2023 Elsevier Ltd. All rights reserved.
引用
下载
收藏
页数:15
相关论文
共 50 条
  • [41] Density peaks clustering based on superior nodes and fuzzy correlation
    Zang, Wenke
    Liu, Xincheng
    Ma, Linlin
    Che, Jing
    Sun, Minghe
    Zhao, Yuzhen
    Liu, Xiyu
    Li, Hui
    INFORMATION SCIENCES, 2024, 672
  • [42] Density peaks clustering algorithm with K-nearest neighbors and weighted similarity
    Zhao J.
    Chen L.
    Wu R.-X.
    Zhang B.
    Han L.-Z.
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2022, 39 (12): : 2349 - 2357
  • [43] Weighted Single-Pass Fuzzy c-Means Algorithm Based on Density Peaks
    Li, Yangyang
    Wang, Qi
    Ran, Kun
    Jiao, Licheng
    PROCEEDINGS OF TENCON 2018 - 2018 IEEE REGION 10 CONFERENCE, 2018, : 2214 - 2217
  • [44] A prototype selection technique based on relative density and density peaks clustering for k nearest neighbor classification
    Xiang, Lina
    INTELLIGENT DATA ANALYSIS, 2023, 27 (03) : 675 - 690
  • [45] Parallel Implementation of Density Peaks Clustering Algorithm Based on Spark
    Liu, Rui
    Li, Xiaoge
    Du, Liping
    Zhi, Shuting
    Wei, Mian
    ADVANCES IN INFORMATION AND COMMUNICATION TECHNOLOGY, 2017, 107 : 442 - 447
  • [46] RFDPC: Density Peaks Clustering Algorithm Based on Resultant Force
    Zhang, Yongzhong
    Huang, Hexiao
    Du, Jie
    Ma, Yan
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [47] An Improvement of Density Peaks Clustering Algorithm Based on KNN and Gravitation
    Sun, Jianyang
    Liu, Guanjun
    2021 4TH INTERNATIONAL CONFERENCE ON INTELLIGENT AUTONOMOUS SYSTEMS (ICOIAS 2021), 2021, : 234 - 239
  • [48] GDPC: Gravitation-based Density Peaks Clustering algorithm
    Jiang, Jianhua
    Hao, Dehao
    Chen, Yujun
    Parmar, Milan
    Li, Keqin
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2018, 502 : 345 - 355
  • [49] An Improved Density Peaks-Based Graph Clustering Algorithm
    Chen, Lei
    Zheng, Heding
    Liu, Zhaohua
    Li, Qing
    Guo, Lian
    Liang, Guangsheng
    ADVANCES IN INTERNET, DATA & WEB TECHNOLOGIES (EIDWT-2022), 2022, 118 : 68 - 80
  • [50] DPCG: an efficient density peaks clustering algorithm based on grid
    Xiao Xu
    Shifei Ding
    Mingjing Du
    Yu Xue
    International Journal of Machine Learning and Cybernetics, 2018, 9 : 743 - 754