A Distance and Density-based Clustering Algorithm using Automatic Peak Detection

被引:8
|
作者
Zhou, Rong [1 ]
Zhang, Shuang [2 ]
Chen, Chun [1 ]
Ning, Li [1 ]
Zhang, Yong [1 ]
Feng, Shengzhong [1 ]
Liu, Yi [1 ]
Luktarhan, Nurbol [3 ]
机构
[1] Chinese Acad Sci, Shenzhen Inst Adv Technol, Ctr High Performance Comp, Shenzhen, Peoples R China
[2] Jining 1 Peoples Hosp, Jining, Peoples R China
[3] Xinjiang Univ, Informat Sci & Engn Coll, Urumqi, Peoples R China
关键词
clustering algorithm; distance-based; density-based; density peak;
D O I
10.1109/SmartCloud.2016.39
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Distance-based and density-based clustering algorithms are often used on large spatial and arbitrary shape of data sets. However, some well-known clustering algorithms have troubles when distribution of objects in the dataset varies, and this may lead to a bad clustering result. Such bad performances are more dramatically significant on high-dimensional dataset. Recently, Rodriguez and Laio proposed an efficient clustering algorithm [1] based on two essential indicators: density and distance, which are used to find the cluster centers and play an important role in the process of clustering. However, this algorithm does not work well on high dimensional data sets, since the threshold of cluster centers has been defined ambiguously and hence it has to be decided visually and manually. In this paper, an alternative definition of the indicators is introduced and the threshold of cluster centers is automatically decided by using an improved Canopy algorithm. With fixed centers (each represents a cluster), each remaining data object is assigned to a cluster dependently in a single step. The performance of the algorithm is analyzed on several benchmarks. The experimental results show that (1) the clustering performance on some high dimensional data sets, e.g., intrusion detection, is better; and (2) on low dimensional data sets, the performances are as good as the traditional clustering algorithms.
引用
收藏
页码:176 / 183
页数:8
相关论文
共 50 条
  • [31] Community Detection in Complex Networks Using Nonnegative Matrix Factorization and Density-Based Clustering Algorithm
    Lu, Hong
    Zhao, Qinghua
    Sang, Xiaoshuang
    Lu, Jianfeng
    NEURAL PROCESSING LETTERS, 2020, 51 (02) : 1731 - 1748
  • [32] Density-based clustering algorithm for numerical and categorical data with mixed distance measure methods
    Chen, Jin-Yin
    He, Hui-Hao
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2015, 32 (08): : 993 - 1002
  • [33] Towards automatic Eps calculation in density-based clustering
    Gorawski, Marcin
    Malczok, Rafal
    ADVANCES IN DATABASES AND INFORMATION SYSTEMS, PROCEEDINGS, 2006, 4152 : 313 - 328
  • [34] Variable Neighborhood Search for Automatic Density-Based Clustering
    Boudane, Fatima
    Berrichi, Ali
    PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON MATHEMATICS AND INFORMATION TECHNOLOGY (ICMIT), 2017, : 141 - 147
  • [35] FDBSCAN-APT: A Fuzzy Density-based Clustering Algorithm with Automatic Parameter Tuning
    Bechini, Alessio
    Criscione, Martina
    Ducange, Pietro
    Marcelloni, Francesco
    Renda, Alessandro
    2020 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2020,
  • [36] A novel density-based clustering algorithm using nearest neighbor graph
    Li, Hao
    Liu, Xiaojie
    Li, Tao
    Gan, Rundong
    PATTERN RECOGNITION, 2020, 102
  • [37] Discovering Density-Based Clustering Structures Using Neighborhood Distance Entropy Consistency
    Kamali, Tahereh
    Stashuk, Daniel W.
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2020, 7 (04) : 1069 - 1080
  • [38] Meteor shower detection with density-based clustering
    Sugar, Glenn
    Moorhead, Althea
    Brown, Peter
    Cooke, William
    METEORITICS & PLANETARY SCIENCE, 2017, 52 (06) : 1048 - 1059
  • [39] Unifying Density-Based Clustering and Outlier Detection
    Tao, Yunxin
    Pi, Dechang
    WKDD: 2009 SECOND INTERNATIONAL WORKSHOP ON KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2009, : 644 - 647
  • [40] Incremental Density-Based Link Clustering Algorithm for Community Detection in Dynamic Networks
    Meng, Fanrong
    Zhang, Feng
    Zhu, Mu
    Xing, Yan
    Wang, Zhixiao
    Shi, Jihong
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2016, 2016