A Distance and Density-based Clustering Algorithm using Automatic Peak Detection

被引:8
|
作者
Zhou, Rong [1 ]
Zhang, Shuang [2 ]
Chen, Chun [1 ]
Ning, Li [1 ]
Zhang, Yong [1 ]
Feng, Shengzhong [1 ]
Liu, Yi [1 ]
Luktarhan, Nurbol [3 ]
机构
[1] Chinese Acad Sci, Shenzhen Inst Adv Technol, Ctr High Performance Comp, Shenzhen, Peoples R China
[2] Jining 1 Peoples Hosp, Jining, Peoples R China
[3] Xinjiang Univ, Informat Sci & Engn Coll, Urumqi, Peoples R China
关键词
clustering algorithm; distance-based; density-based; density peak;
D O I
10.1109/SmartCloud.2016.39
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Distance-based and density-based clustering algorithms are often used on large spatial and arbitrary shape of data sets. However, some well-known clustering algorithms have troubles when distribution of objects in the dataset varies, and this may lead to a bad clustering result. Such bad performances are more dramatically significant on high-dimensional dataset. Recently, Rodriguez and Laio proposed an efficient clustering algorithm [1] based on two essential indicators: density and distance, which are used to find the cluster centers and play an important role in the process of clustering. However, this algorithm does not work well on high dimensional data sets, since the threshold of cluster centers has been defined ambiguously and hence it has to be decided visually and manually. In this paper, an alternative definition of the indicators is introduced and the threshold of cluster centers is automatically decided by using an improved Canopy algorithm. With fixed centers (each represents a cluster), each remaining data object is assigned to a cluster dependently in a single step. The performance of the algorithm is analyzed on several benchmarks. The experimental results show that (1) the clustering performance on some high dimensional data sets, e.g., intrusion detection, is better; and (2) on low dimensional data sets, the performances are as good as the traditional clustering algorithms.
引用
收藏
页码:176 / 183
页数:8
相关论文
共 50 条
  • [41] An automatic clustering algorithm based on the density-peak framework and Chameleon method
    Liang, Zhou
    Chen, Pei
    PATTERN RECOGNITION LETTERS, 2021, 150 : 40 - 48
  • [42] An automatic clustering algorithm based on the density-peak framework and Chameleon method
    Liang, Zhou
    Chen, Pei
    Pattern Recognition Letters, 2021, 150 : 40 - 48
  • [43] A peak density clustering algorithm based on the automatic selection of the cluster center points
    Cui, Shi-Qi
    Liu, Bing
    Li, Yong
    Liu, Hui
    Journal of Computers (Taiwan), 2020, 31 (06) : 38 - 51
  • [44] An Algorithm to Adaptive Determination of Density Threshold for Density-based Clustering
    Ke, Zhang
    Lei, Huang
    Yi, Chai
    PROCEEDINGS OF THE 35TH CHINESE CONTROL CONFERENCE 2016, 2016, : 3929 - 3935
  • [45] A Density-based clustering algorithm suitable to various density dataset
    School of Software, Dalian University of Technology, Dalian 116621, China
    J. Comput. Inf. Syst., 2008, 6 (2473-2481):
  • [46] RECOME: A new density-based clustering algorithm using relative KNN kernel density
    Geng, Yangli-ao
    Li, Qingyong
    Zheng, Rong
    Zhuang, Fuzhen
    He, Ruisi
    Xiong, Naixue
    INFORMATION SCIENCES, 2018, 436 : 13 - 30
  • [47] Tabular Data Anomaly Detection Based on Density Peak Clustering Algorithm
    Liang, Dong
    Wang, Jun
    Zhang, Wenping
    Liu, Yuqi
    Wang, Lei
    Zhao, Xiaoyong
    2022 INTERNATIONAL CONFERENCE ON BIG DATA, INFORMATION AND COMPUTER NETWORK (BDICN 2022), 2022, : 16 - 21
  • [48] Rolling Element Bearing Fault Detection Using Density-Based Clustering
    Tian, Jing
    Azarian, Michael H.
    Pecht, Michael
    2014 IEEE CONFERENCE ON PROGNOSTICS AND HEALTH MANAGEMENT (PHM), 2014,
  • [49] A Density-Based Adaptive Distance Fuzzy Clustering Algorithm Based on the Multi-target Traffic Radar
    Zhang, Xinyi
    Cao, Lin
    Wang, Tao
    2020 13TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2020), 2020, : 511 - 515
  • [50] A GPU-Accelerated Density-Based Clustering Algorithm
    Loh, Woong-Kee
    Kim, Young-Kuk
    2014 IEEE FOURTH INTERNATIONAL CONFERENCE ON BIG DATA AND CLOUD COMPUTING (BDCLOUD), 2014, : 775 - 776