Graph Distance and Adaptive K-Nearest Neighbors Selection-Based Density Peak Clustering

被引:0
|
作者
Sun, Yuqin [1 ]
Wang, Jingcong [1 ]
Sun, Yuan [1 ]
Zhang, Pengcheng [1 ]
Wang, Tianyi [1 ]
机构
[1] Shanghai Univ Elect Power, Sch Math & Phys, Shanghai 201306, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
基金
中国国家自然科学基金;
关键词
Clustering algorithms; Partitioning algorithms; Resource management; Kernel; Robustness; Manifolds; Euclidean distance; Nearest neighbor methods; Adaptive K-neighbors; allocation strategy; density peak clustering; graph distance; natural neighbors; PATTERNS; SEARCH;
D O I
10.1109/ACCESS.2024.3403128
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Density Peak Clustering (DPC) is known for its rapid identification of cluster centers and successful clustering tasks. However, traditional DPC encounters several issues, which include simplifications in local density and distance metrics, a non-robust single-allocation strategy and limited fault tolerance. To address these challenges, this study introduces an innovative density peak clustering algorithm, named Graph Distance and Adaptive K-Nearest Neighbors Selection-Based Density Peak Clustering (GAK-DPC). Our goal with the approach is to enhance the algorithm's adaptability to non-linear and complex data structures. We achieve this by replacing the traditional Euclidean distance with graph distance. Additionally, we redefine the method for computing local density based on information from K-nearest neighbor data points. By introducing the concept of natural neighbors, the neighborhood radius r is obtained when all instances in the dataset have at least one natural neighbor. Then for the current data point, the number of data points falling within a circle centered on it with radius r is counted as the K-value of that data point. Thus, we achieve the adaptive selection of the K-value. This adaptive K-value strategy takes into account the dataset's characteristics and inter-point neighbor relationships, which enhances the algorithm's adaptability and robustness. Finally, we optimize the secondary allocation strategy for sample points to improve the algorithm's fault tolerance. By conducting comparisons with traditional clustering algorithms on UCI datasets and synthetic datasets, we demonstrate the effectiveness of GAK-DPC.
引用
下载
收藏
页码:71783 / 71796
页数:14
相关论文
共 50 条
  • [1] Adaptive density peak clustering based on K-nearest neighbors with aggregating strategy
    Liu Yaohui
    Ma Zhengming
    Yu Fang
    KNOWLEDGE-BASED SYSTEMS, 2017, 133 : 208 - 220
  • [2] Density Peak Clustering Algorithm Based on K-nearest Neighbors and Optimized Allocation Strategy
    Sun L.
    Qin X.-Y.
    Xu J.-C.
    Xue Z.-A.
    Ruan Jian Xue Bao/Journal of Software, 2022, 33 (04): : 1390 - 1411
  • [3] Density Peaks Clustering Algorithm Based on Weighted k-Nearest Neighbors and Geodesic Distance
    Liu, Lina
    Yu, Donghua
    IEEE ACCESS, 2020, 8 : 168282 - 168296
  • [4] Relative density based K-nearest neighbors clustering algorithm
    Liu, QB
    Deng, S
    Lu, CH
    Wang, B
    Zhou, YF
    2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 133 - 137
  • [5] Density peaks clustering based on k-nearest neighbors sharing
    Fan, Tanghuai
    Yao, Zhanfeng
    Han, Longzhe
    Liu, Baohong
    Lv, Li
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (05):
  • [6] Graph Clustering Using Mutual K-Nearest Neighbors
    Sardana, Divya
    Bhatnagar, Raj
    ACTIVE MEDIA TECHNOLOGY, AMT 2014, 2014, 8610 : 35 - 48
  • [7] A novel density peaks clustering algorithm for automatic selection of clustering centers based on K-nearest neighbors
    Wang, Zhihe
    Wang, Huan
    Du, Hui
    Chen, Shiyin
    Shi, Xinxin
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (07) : 11875 - 11894
  • [8] Fault Detection With An Adaptive Distance For The k-Nearest Neighbors Rule
    Verdier, Ghislain
    Ferreira, Ariane
    CIE: 2009 INTERNATIONAL CONFERENCE ON COMPUTERS AND INDUSTRIAL ENGINEERING, VOLS 1-3, 2009, : 1273 - 1278
  • [9] K-nearest neighbors in uncertain graph
    Zhang, Yinglong
    Li, Cuiping
    Chen, Hong
    Du, Lingxia
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2011, 48 (10): : 1850 - 1858
  • [10] A grid resource discovery method based on adaptive k-nearest neighbors clustering
    Zhang, Yan
    Jia, Yan
    Huang, Xiaobin
    Zhou, Bin
    Gu, Jian
    COMBINATORIAL OPTIMIZATION AND APPLICATIONS, PROCEEDINGS, 2007, 4616 : 171 - +