A Novel Density based Clustering Algorithm and Its Parallelization

被引:13
|
作者
Li, Xiaokang [1 ]
Yu, Binbin [1 ]
Zhou, Yinghua [1 ]
Sun, Guangzhong [1 ]
机构
[1] Univ Sci & Technol China, Sch Comp Sci & Technol, Hefei 230026, Anhui, Peoples R China
关键词
Kmms; clustering algorithm; density; parallelize; INITIALIZATION;
D O I
10.1109/PDCAT.2014.9
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
k-Means, a simple but effective clustering algorithm, is widely used in data mining, machine learning and computer vision community. k-Means algorithm consists of initialization of duster centers and iteration. The initial duster centers have a great impact on duster result and algorithm efficiency. More appropriate initial centers of k-Means can get closer to the optimum solution, and even much quicker convergence. In this paper, we propose a novel clustering algorithm, Kmms, which is the abbreviation of k-Means and Mean Shift. It is a density based algorithm. Experiments show our algorithm not only costs less initialization time compared with other density based algorithms, but also achieves better clustering quality and higher efficiency. And compared with the popular k-Means++ algorithm, our method gets comparable accuracy, mostly even better. Furthermore, we parallelize Kmms algorithm based on OPenMP from both initialization and iteration step and prove the convergence of the algorithm.
引用
收藏
页码:1 / 6
页数:6
相关论文
共 50 条
  • [31] A novel density peaks clustering algorithm for mixed data
    Du, Mingjing
    Ding, Shifei
    Xue, Yu
    PATTERN RECOGNITION LETTERS, 2017, 97 : 46 - 53
  • [32] A Discretization Algorithm for Meteorological Data and its Parallelization Based on Hadoop
    Liu, Chao
    Jin, Wen
    Yu, Yuting
    Qiu, Taorong
    Bai, Xiaoming
    Zou, Shuilong
    2017 INTERNATIONAL CONFERENCE ON CLOUD TECHNOLOGY AND COMMUNICATION ENGINEERING (CTCE2017), 2017, 910
  • [33] Novel Frequent Pattern Mining Algorithm based on Parallelization scheme
    Gatuha, George
    Jiang, Tao
    INTERNATIONAL JOURNAL OF ENGINEERING RESEARCH IN AFRICA, 2016, 23 : 131 - 140
  • [34] An Expanding Clustering Algorithm Based on Density Searching
    Tan, Liguo
    Liu, Yang
    Chen, Xinglin
    INFORMATION AND MANAGEMENT ENGINEERING, PT VI, 2011, 236 : 110 - 116
  • [35] Clustering Algorithm Based on Characteristics of Density Distribution
    Zheng Hua
    Wang Zhenxing
    Zhang Liancheng
    Wang Qian
    2ND IEEE INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER CONTROL (ICACC 2010), VOL. 2, 2010, : 431 - 435
  • [36] A Weighted Fuzzy Clustering Algorithm Based on Density
    Li, Cuixia
    Tan, Yingjun
    ADVANCES IN FUTURE COMPUTER AND CONTROL SYSTEMS, VOL 2, 2012, 160 : 205 - +
  • [37] Clustering algorithm based on density function and nichePSO
    Guo, Chonghui
    Zang, Yunhui
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2012, 23 (03) : 445 - 452
  • [38] Fast density-based clustering algorithm
    Zhou, Shuigeng
    Zhou, Aoying
    Cao, Jing
    Hu, Yunfa
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2000, 37 (11): : 1287 - 1292
  • [39] A Density Clustering Algorithm Based on Data Partitioning
    Li, Dongping
    PROCEEDINGS OF ANNUAL CONFERENCE OF CHINA INSTITUTE OF COMMUNICATIONS, 2010, : 251 - 254
  • [40] piClust: A density based piRNA clustering algorithm
    Jung, Inuk
    Park, Jong Chan
    Kim, Sun
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2014, 50 : 60 - 67