Density peaks clustering algorithm based on fuzzy and weighted shared neighbor for uneven density datasets

被引:19
|
作者
Zhao, Jia [1 ]
Wang, Gang [1 ]
Pan, Jeng-Shyang [2 ]
Fan, Tanghuai [1 ]
Lee, Ivan [3 ]
机构
[1] Nanchang Inst Technol, Sch Informat Engn, Nanchang 330099, Peoples R China
[2] Shandong Univ Sci & Technol, Coll Comp Sci & Engn, Qingdao 266590, Peoples R China
[3] Univ South Australia, UniSA STEM, Adelaide, SA 5000, Australia
基金
中国国家自然科学基金;
关键词
Uneven density data; Density peaks clustering; Fuzzy neighborhood; K-nearest neighbor; Weighted shared neighbor;
D O I
10.1016/j.patcog.2023.109406
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Uneven density data refers to data with a certain difference in sample density between clusters. The local density of density peaks clustering algorithm (DPC) does not consider the effect of sample den-sity difference between clusters of uneven density data, which may lead to wrong selection of cluster centers; the algorithm allocation strategy makes it easy to incorrectly allocate samples originally belong-ing to sparse clusters to dense clusters, which reduces clustering efficiency. In this study, we proposed the density peaks clustering algorithm based on fuzzy and weighted shared neighbor for uneven density datasets (DPC-FWSN). First, a nearest neighbor fuzzy kernel function is obtained by combining K-nearest neighbor and fuzzy neighborhood. Then, local density is redefined by the nearest neighbor fuzzy ker-nel function. The local density can better characterize the distribution characteristics of the sample by balancing the contribution of sample density in dense and sparse areas, in order to avoid the situation that the sparse cluster does not have a cluster center. Finally, the allocation strategy for weighted shared neighbor similarity is proposed to optimize the sample allocation at the boundary of the sparse cluster. Experiments are performed on IDPC-FA, FKNN-DPC, FNDPC, DPCSA and DPC for uneven density datasets, complex morphologies datasets and real datasets. The clustering results demonstrate that DPC-FWSN ef-fectively handles datasets with uneven density distribution.(c) 2023 Elsevier Ltd. All rights reserved.
引用
下载
收藏
页数:15
相关论文
共 50 条
  • [21] A robust density peaks clustering algorithm using fuzzy neighborhood
    Du, Mingjing
    Ding, Shifei
    Xue, Yu
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2018, 9 (07) : 1131 - 1140
  • [22] A Novel Density Peaks Clustering Algorithm Based on Local Reachability Density
    Hanqing Wang
    Bin Zhou
    Jianyong Zhang
    Ruixue Cheng
    International Journal of Computational Intelligence Systems, 2020, 13 : 690 - 697
  • [23] A Novel Density Peaks Clustering Algorithm Based on Local Reachability Density
    Wang, Hanqing
    Zhou, Bin
    Zhang, Jianyong
    Cheng, Ruixue
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2020, 13 (01) : 690 - 697
  • [24] Adaptive weighted over-sampling for imbalanced datasets based on density peaks clustering with heuristic filtering
    Tao, Xinmin
    Li, Qing
    Guo, Wenjie
    Ren, Chao
    He, Qing
    Liu, Rui
    Zou, JunRong
    INFORMATION SCIENCES, 2020, 519 : 43 - 73
  • [25] Fast Searching Density Peak Clustering Algorithm Based on Shared Nearest Neighbor and Adaptive Clustering Center
    Lv, Yi
    Liu, Mandan
    Xiang, Yue
    SYMMETRY-BASEL, 2020, 12 (12): : 1 - 26
  • [26] Improved Density Peaks Clustering Based on Natural Neighbor Expanded Group
    Ding, Lin
    Xu, Weihong
    Chen, Yuantao
    COMPLEXITY, 2020, 2020 (2020)
  • [27] Incremental Shared Nearest Neighbor Density-Based Clustering
    Singh, Sumeet
    Awekar, Amit
    PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 1533 - 1536
  • [28] Coflow scheduling algorithm based density peaks clustering
    Li, Chenghao
    Zhang, Huyin
    Zhou, Tianying
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 97 : 805 - 813
  • [29] Cosine kernel based density peaks clustering algorithm
    Wang, Jiayuan
    Lv, Li
    Wu, Runxiu
    Fan, Tanghuai
    Lee, Ivan
    INTERNATIONAL JOURNAL OF COMPUTING SCIENCE AND MATHEMATICS, 2020, 12 (01) : 1 - 20
  • [30] A text clustering algorithm based on find of density peaks
    Liu, Peiyu
    Liu, Yingying
    Hou, Xiuyan
    Li, Qingqing
    Zhu, Zhenfang
    2015 7TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY IN MEDICINE AND EDUCATION (ITME), 2015, : 348 - 352