GRIDEN: An effective grid-based and density-based spatial clustering algorithm to support parallel computing

被引:20
|
作者
Deng, Chao [1 ,2 ]
Song, Jinwei [3 ]
Sun, Ruizhi [1 ]
Cai, Saihua [1 ]
Shi, Yinxue [1 ]
机构
[1] China Agr Univ, 17 Tsinghua East Rd, Beijing 100083, Peoples R China
[2] China Tobacco Guangxi Ind Co LTD, 28Beihu South Rd, Nanning 530001, Peoples R China
[3] Chinese Acad Sci, Natl Space Sci Ctr, 1 Nanertiao, Beijing 100190, Peoples R China
关键词
Grid based clustering; Density based clustering; DBSCAN; GRIDEN; Data mining; Massive spatial data; Parallel computing; FAST SEARCH; DBSCAN; PEAKS; FIND;
D O I
10.1016/j.patrec.2017.11.011
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Density-based clustering has been widely used in many fields. A new effective grid-based and density-based spatial clustering algorithm, GRIDEN, is proposed in this paper, which supports parallel computing in addition to multi-density clustering. It constructs grids using hyper-square cells and provides users with parameter k to control the balance between efficiency and accuracy to increase the flexibility of the algorithm. Compared with conventional density-based algorithms, it achieves much higher performance by eliminating distance calculations among points based on the newly proposed concept of epsilon neighbor cells. Compared with conventional grid-based algorithms, it uses a set of symmetric (2k + 1)(D) cells to identify dense cells and the density-connected relationships among cells. Therefore, the maximum calculated deviation of epsilon-neighbor points in the grid-based algorithm can be controlled to an acceptable level through parameter k. In our experiments, the results demonstrate that GRIDEN can achieve a reliable clustering result that is infinite closed with respect to the exact DBSCAN as parameter k grows, and it requires computational time that is only linear to N. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:81 / 88
页数:8
相关论文
共 50 条
  • [1] Research on application of grid-based and density-based clustering algorithm
    Shen, LX
    Yan, C
    [J]. PROCEEDINGS OF 2003 INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE & ENGINEERING, VOLS I AND II, 2003, : 684 - 689
  • [2] A Grid-Based Density Peaks Clustering Algorithm
    Fang, Xintong
    Xu, Zhen
    Ji, Haifeng
    Wang, Baoliang
    Huang, Zhiyao
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (04) : 5476 - 5484
  • [3] Incremental grid density-based clustering algorithm
    Chen, Ning
    Chen, An
    Zhou, Long-Xiang
    [J]. Ruan Jian Xue Bao/Journal of Software, 2002, 13 (01): : 1 - 7
  • [4] Grid-based clustering algorithm for muilti-density
    Qiu, BZ
    Zhang, XZ
    Shen, JY
    [J]. PROCEEDINGS OF 2005 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-9, 2005, : 1509 - 1512
  • [5] EDACluster: An evolutionary density and grid-based clustering algorithm
    De Oliveira, Cisar S.
    Godinho, Paulo Igor
    Meiguins, Aruanda S. G.
    Meiguins, Bianchi S.
    Freitas, Alex A.
    [J]. PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, 2007, : 143 - +
  • [6] PGMCLU: A Novel Parallel Grid-based Clustering Algorithm for Multi-density Datasets
    Chen Xiaoyun
    Chen Yi
    Qi Xiaoli
    Yue Min
    He Yanshan
    [J]. 2009 1ST IEEE SYMPOSIUM ON WEB SOCIETY, PROCEEDINGS, 2009, : 166 - 171
  • [7] K-DBSCAN: An efficient density-based clustering algorithm supports parallel computing
    Deng, Chao
    Song, Jinwei
    Cai, Saihua
    Sun, Ruizhi
    Shi, Yinxue
    Hao, Shangbo
    [J]. International Journal of Simulation and Process Modelling, 2018, 13 (05) : 496 - 505
  • [8] Grid-based clustering algorithm based on intersecting partition and density estimation
    Qiu, Bao-Zhi
    Li, Xiang-Li
    Shen, Jun-Yi
    [J]. EMERGING TECHNOLOGIES IN KNOWLEDGE DISCOVERY AND DATA MINING, 2007, 4819 : 368 - +
  • [9] GRIDBSCAN: GRId density-based spatial clustering of applications with noise
    Uncu, Ozge
    Gruver, William A.
    Kotak, Dilip B.
    Sabaz, Dorian
    Alibhai, Zafeer
    Ng, Colin
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-6, PROCEEDINGS, 2006, : 2976 - +
  • [10] A Grid and Density-based Clustering Algorithm for Processing Data Stream
    Jia, Chen
    Tan, ChengYu
    Yong, Ai
    [J]. SECOND INTERNATIONAL CONFERENCE ON GENETIC AND EVOLUTIONARY COMPUTING: WGEC 2008, PROCEEDINGS, 2008, : 517 - +