An Adaptive Clustering Algorithm Based on Local-Density Peaks for Imbalanced Data Without Parameters

被引:8
|
作者
Tong, Wuning [1 ,2 ]
Wang, Yuping [1 ]
Liu, Delong [1 ]
机构
[1] Xidian Univ, Sch Comp Sci & Technol, Xian 710071, Shaanxi, Peoples R China
[2] Shaanxi Univ Chinese Med, Dept Sci & Technol, Xianyang 712046, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Clustering algorithms; Machine learning algorithms; Machine learning; Computer science; Clustering methods; Task analysis; Shape; Data clustering; density peaks; imbalanced data; multiple centers; FAST SEARCH; NEIGHBOR; FIND; NUMBER; RULE;
D O I
10.1109/TKDE.2021.3138962
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Imbalanced data clustering is a challenging problem in machine learning. The main difficulty is caused by the imbalance in both cluster size and data density distribution. To address this problem, we propose a novel clustering algorithm called LDPI based on local-density peaks in this study. First, an initial sub-cluster construction scheme is designed based on a 3-dimensional (3-D) decision graph that can easily detect the initial sub-cluster centers and identify the noise points. Second, a sub-cluster updating strategy is designed, which can automatically identify the false sub-cluster centers and update the initial sub-clusters. Third, a sub-cluster merging scheme is designed, which merges the updated initial sub-clusters into final clusters. Consequently, the proposed algorithm has three advantages: 1) It does not require any input parameters; 2) It can automatically determine the cluster centers and number of clusters; 3) It is suitable for imbalanced datasets and datasets with arbitrary shapes and distributions. The effectiveness of LDPI is demonstrated experimentally and the superiority of LDPI is identified by comparison with 5 state-of-the-art algorithms.
引用
收藏
页码:3419 / 3432
页数:14
相关论文
共 50 条
  • [1] An Ensemble Learning Algorithm Based on Density Peaks Clustering and Fitness for Imbalanced Data
    Xu, Hui
    Liu, Qicheng
    [J]. IEEE ACCESS, 2022, 10 : 116120 - 116128
  • [2] A local-density based spatial clustering algorithm with noise
    Duan, Lian
    Xu, Lida
    Guo, Feng
    Lee, Jun
    Yan, Baopin
    [J]. INFORMATION SYSTEMS, 2007, 32 (07) : 978 - 986
  • [3] OALDPC: oversampling approach based on local density peaks clustering for imbalanced classification
    Li, Junnan
    Zhu, Qingsheng
    [J]. APPLIED INTELLIGENCE, 2023, 53 (24) : 30987 - 31017
  • [4] OALDPC: oversampling approach based on local density peaks clustering for imbalanced classification
    Junnan Li
    Qingsheng Zhu
    [J]. Applied Intelligence, 2023, 53 : 30987 - 31017
  • [5] A Novel Density Peaks Clustering Algorithm Based on Local Reachability Density
    Hanqing Wang
    Bin Zhou
    Jianyong Zhang
    Ruixue Cheng
    [J]. International Journal of Computational Intelligence Systems, 2020, 13 : 690 - 697
  • [6] A Novel Density Peaks Clustering Algorithm Based on Local Reachability Density
    Wang, Hanqing
    Zhou, Bin
    Zhang, Jianyong
    Cheng, Ruixue
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2020, 13 (01) : 690 - 697
  • [7] LD-BSCA: A Local-Density Based Spatial Clustering Algorithm
    Wei, Guiyi
    Liu, Haiping
    [J]. 2009 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DATA MINING, 2009, : 291 - 298
  • [8] Hierarchical clustering algorithm based on natural local density peaks
    Cai, Fapeng
    Feng, Ji
    Yang, Degang
    Chen, Zhongshang
    [J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (11) : 7989 - 8004
  • [9] An Adaptive Clustering Algorithm by Finding Density Peaks
    Xie, Juanying
    Jiang, Weiliang
    [J]. PRICAI 2018: TRENDS IN ARTIFICIAL INTELLIGENCE, PT II, 2018, 11013 : 317 - 325
  • [10] Clustering large spatial data with local-density and its application
    Wei, Guiyi
    Liu, Haiping
    Xie, Mande
    [J]. Information Technology Journal, 2009, 8 (04) : 476 - 485