Graph Distance and Adaptive K-Nearest Neighbors Selection-Based Density Peak Clustering

被引:0
|
作者
Sun, Yuqin [1 ]
Wang, Jingcong [1 ]
Sun, Yuan [1 ]
Zhang, Pengcheng [1 ]
Wang, Tianyi [1 ]
机构
[1] Shanghai Univ Elect Power, Sch Math & Phys, Shanghai 201306, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
基金
中国国家自然科学基金;
关键词
Clustering algorithms; Partitioning algorithms; Resource management; Kernel; Robustness; Manifolds; Euclidean distance; Nearest neighbor methods; Adaptive K-neighbors; allocation strategy; density peak clustering; graph distance; natural neighbors; PATTERNS; SEARCH;
D O I
10.1109/ACCESS.2024.3403128
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Density Peak Clustering (DPC) is known for its rapid identification of cluster centers and successful clustering tasks. However, traditional DPC encounters several issues, which include simplifications in local density and distance metrics, a non-robust single-allocation strategy and limited fault tolerance. To address these challenges, this study introduces an innovative density peak clustering algorithm, named Graph Distance and Adaptive K-Nearest Neighbors Selection-Based Density Peak Clustering (GAK-DPC). Our goal with the approach is to enhance the algorithm's adaptability to non-linear and complex data structures. We achieve this by replacing the traditional Euclidean distance with graph distance. Additionally, we redefine the method for computing local density based on information from K-nearest neighbor data points. By introducing the concept of natural neighbors, the neighborhood radius r is obtained when all instances in the dataset have at least one natural neighbor. Then for the current data point, the number of data points falling within a circle centered on it with radius r is counted as the K-value of that data point. Thus, we achieve the adaptive selection of the K-value. This adaptive K-value strategy takes into account the dataset's characteristics and inter-point neighbor relationships, which enhances the algorithm's adaptability and robustness. Finally, we optimize the secondary allocation strategy for sample points to improve the algorithm's fault tolerance. By conducting comparisons with traditional clustering algorithms on UCI datasets and synthetic datasets, we demonstrate the effectiveness of GAK-DPC.
引用
下载
收藏
页码:71783 / 71796
页数:14
相关论文
共 50 条
  • [22] Density based clustering algorithm for distributed datasets using mutual K-nearest neighbors
    Salim A.
    International Journal of Advanced Computer Science and Applications, 2019, 10 (03): : 620 - 630
  • [23] Study on density peaks clustering based on k-nearest neighbors and principal component analysis
    Du, Mingjing
    Ding, Shifei
    Jia, Hongjie
    KNOWLEDGE-BASED SYSTEMS, 2016, 99 : 135 - 145
  • [24] AN APPROXIMATE CLUSTERING TECHNIQUE BASED ON THE K-NEAREST NEIGHBORS METHOD
    KOVALENKO, AP
    AUTOMATION AND REMOTE CONTROL, 1992, 53 (10) : 1592 - 1598
  • [25] DPC-DNG: Graph-based label propagation of k-nearest higher-density neighbors for density peaks clustering
    Li, Yan
    Sun, Lingyun
    Tang, Yongchuan
    APPLIED SOFT COMPUTING, 2024, 161
  • [26] An Improved Clustering Algorithm Based on Density Peak and Nearest Neighbors
    Zhao, Chao
    Yang, Junchuang
    Wen, Kexin
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [27] Spectral Clustering Based on k-Nearest Neighbor Graph
    Lucinska, Malgorzata
    Wierzchon, Lawomir T.
    COMPUTER INFORMATION SYSTEMS AND INDUSTRIAL MANAGEMENT (CISIM), 2012, 7564 : 254 - 265
  • [28] The research on an adaptive k-nearest neighbors classifier
    Yu, Xiao-Gao
    Yu, Xiao-Peng
    PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 1241 - 1246
  • [29] Locally Adaptive Text Classification based k-nearest Neighbors
    Yu, Xiao-gao
    Yu, Xiao-peng
    2007 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-15, 2007, : 5651 - +
  • [30] The research on an adaptive k-nearest neighbors classifier
    Yu, Xiaopeng
    Yu, Xiaogao
    PROCEEDINGS OF THE FIFTH IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS, VOLS 1 AND 2, 2006, : 535 - 540