An Adaptive Clustering Algorithm Based on Local-Density Peaks for Imbalanced Data Without Parameters

被引:8
|
作者
Tong, Wuning [1 ,2 ]
Wang, Yuping [1 ]
Liu, Delong [1 ]
机构
[1] Xidian Univ, Sch Comp Sci & Technol, Xian 710071, Shaanxi, Peoples R China
[2] Shaanxi Univ Chinese Med, Dept Sci & Technol, Xianyang 712046, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Clustering algorithms; Machine learning algorithms; Machine learning; Computer science; Clustering methods; Task analysis; Shape; Data clustering; density peaks; imbalanced data; multiple centers; FAST SEARCH; NEIGHBOR; FIND; NUMBER; RULE;
D O I
10.1109/TKDE.2021.3138962
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Imbalanced data clustering is a challenging problem in machine learning. The main difficulty is caused by the imbalance in both cluster size and data density distribution. To address this problem, we propose a novel clustering algorithm called LDPI based on local-density peaks in this study. First, an initial sub-cluster construction scheme is designed based on a 3-dimensional (3-D) decision graph that can easily detect the initial sub-cluster centers and identify the noise points. Second, a sub-cluster updating strategy is designed, which can automatically identify the false sub-cluster centers and update the initial sub-clusters. Third, a sub-cluster merging scheme is designed, which merges the updated initial sub-clusters into final clusters. Consequently, the proposed algorithm has three advantages: 1) It does not require any input parameters; 2) It can automatically determine the cluster centers and number of clusters; 3) It is suitable for imbalanced datasets and datasets with arbitrary shapes and distributions. The effectiveness of LDPI is demonstrated experimentally and the superiority of LDPI is identified by comparison with 5 state-of-the-art algorithms.
引用
收藏
页码:3419 / 3432
页数:14
相关论文
共 50 条
  • [21] An Improved Density Peaks Clustering Algorithm Based On Density Ratio
    Zou, Yujuan
    Wang, Zhijian
    Xu, Pengfei
    Lv, Taizhi
    [J]. COMPUTER JOURNAL, 2024, 67 (07): : 2515 - 2528
  • [22] A novel density peaks clustering with sensitivity of local density and density-adaptive metric
    Du, Mingjing
    Ding, Shifei
    Xue, Yu
    Shi, Zhongzhi
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2019, 59 (02) : 285 - 309
  • [23] A novel density peaks clustering with sensitivity of local density and density-adaptive metric
    Mingjing Du
    Shifei Ding
    Yu Xue
    Zhongzhi Shi
    [J]. Knowledge and Information Systems, 2019, 59 : 285 - 309
  • [24] A spectral clustering algorithm based on attribute fluctuation and density peaks clustering algorithm
    Xin Song
    Shuhua Li
    Ziqiang Qi
    Jianlin Zhu
    [J]. Applied Intelligence, 2023, 53 : 10520 - 10534
  • [25] A spectral clustering algorithm based on attribute fluctuation and density peaks clustering algorithm
    Song, Xin
    Li, Shuhua
    Qi, Ziqiang
    Zhu, Jianlin
    [J]. APPLIED INTELLIGENCE, 2023, 53 (09) : 10520 - 10534
  • [26] Clustering based on local density peaks and graph cut
    Long, Zhiguo
    Gao, Yang
    Meng, Hua
    Yao, Yuqin
    Li, Tianrui
    [J]. INFORMATION SCIENCES, 2022, 600 : 263 - 286
  • [27] Local-Density Subspace Distributed Clustering for High-Dimensional Data
    Geng, Yangli-ao
    Li, Qingyong
    Liang, Mingfei
    Chi, Chong-Yung
    Tan, Juan
    Huang, Heng
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2020, 31 (08) : 1799 - 1814
  • [28] Density peaks clustering algorithm with connected local density and punished relative distance
    Xiong, Jingwen
    Zang, Wenke
    Zhao, Yuzhen
    Liu, Xiyu
    [J]. JOURNAL OF SUPERCOMPUTING, 2024, 80 (05): : 6140 - 6168
  • [29] Density peaks clustering algorithm with connected local density and punished relative distance
    Jingwen Xiong
    Wenke Zang
    Yuzhen Zhao
    Xiyu Liu
    [J]. The Journal of Supercomputing, 2024, 80 : 6140 - 6168
  • [30] Coflow scheduling algorithm based density peaks clustering
    Li, Chenghao
    Zhang, Huyin
    Zhou, Tianying
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 97 : 805 - 813