A Systematic Review of Density Grid-Based Clustering for Data Streams

被引:14
|
作者
Tareq, Mustafa [1 ]
Sundararajan, Elankovan A. [2 ]
Harwood, Aaron [3 ]
Abu Bakar, Azuraliza [4 ]
机构
[1] Al Hikma Univ Coll, Dept Comp Technol Engn, Baghdad 10015, Iraq
[2] Univ Kebangsaan Malaysia, Fac Informat Sci & Technol, Ctr Software Technol & Management, Bangi 43600, Selangor, Malaysia
[3] Univ Melbourne, Sch Comp & Informat Sci, Melbourne, Vic 3010, Australia
[4] Univ Kebangsaan Malaysia, Ctr Articial Intelligence & Technol, Fac Informat Sci & Technol, Bangi 43600, Selangor, Malaysia
关键词
Clustering algorithms; Data mining; Systematics; Protocols; Databases; Real-time systems; Licenses; Clustering; data stream; grid-based clustering; data stream clustering; density-based clustering; EVOLVING DATA STREAMS; ALGORITHM; FUTURE;
D O I
10.1109/ACCESS.2021.3134704
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Various applications, such as electronic business, satellite remote sensing, intrusion discovery, and network traffic monitoring, generate large unbounded data stream sequences at a rapid pace. The clustering of data streams has attracted considerable interest due to the increasing usage of evolving data streams. In particular, evolving data streams affect clustering because they introduce numerous challenges, such as time and memory limits and one-pass clustering. Furthermore, researchers need to be able to determine arbitrarily shaped clusters present in evolving data streams from applications. Due to these characteristics, conventional density grid-based clustering techniques cannot be used. Moreover, the existing density grid-based clustering algorithms have low cluster quality for clustering evolving data streams. This study conducted a systematic literature review (SLR) and noted numerous research-related issues encountered in solving the aforementioned problems. We summarized numerous grid-based clustering algorithms that have been used and determined their distinctive and limited features. We also observed how these algorithms address the challenges affecting the clustering of evolving data streams and studied their advantages and disadvantages. SLR was based on 104 articles published between 2010 and 2021. Numerous challenges remain for grid-based clustering algorithms, particularly in terms of time-limited and high-dimensional data handling. Last, our findings indicated a variety of active studies on density grid-based clustering.
引用
收藏
页码:579 / 596
页数:18
相关论文
共 50 条
  • [41] Gridwave: a grid-based clustering algorithm for market transaction data based on spatial-temporal density-waves and synchronization
    Chao Deng
    Jinwei Song
    Ruizhi Sun
    Saihua Cai
    Yinxue Shi
    Multimedia Tools and Applications, 2018, 77 : 29623 - 29637
  • [42] Gridwave: a grid-based clustering algorithm for market transaction data based on spatial-temporal density-waves and synchronization
    Deng, Chao
    Song, Jinwei
    Sun, Ruizhi
    Cai, Saihua
    Shi, Yinxue
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (22) : 29623 - 29637
  • [43] Non-parametric grid-based clustering algorithm for remote sensing data
    Pestunov, IA
    Sinyavsky, YN
    Proceedings of the Second IASTED International Multi-Conference on Automation, Control, and Information Technology - Signal and Image Processing, 2005, : 5 - 9
  • [44] Stream Data Clustering Based on Grid Density and Attraction
    Tu, Li
    Chen, Yixin
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2009, 3 (03)
  • [45] Clustering Algorithm Based on Grid and Density for Data Stream
    Wang, Lang
    Li, Haiqing
    MATERIALS SCIENCE, ENERGY TECHNOLOGY, AND POWER ENGINEERING I, 2017, 1839
  • [46] A Research about grid-based spatial clustering method on regional data analysis
    Zhang, Yu-Wei
    Wan, Lu-He
    Journal of Harbin Institute of Technology (New Series), 2011, 18 (SUPPL. 1) : 171 - 175
  • [47] GACH: a grid-based algorithm for hierarchical clustering of high-dimensional data
    Mansoori, Eghbal G.
    SOFT COMPUTING, 2014, 18 (05) : 905 - 922
  • [48] PGMCLU: A Novel Parallel Grid-based Clustering Algorithm for Multi-density Datasets
    Chen Xiaoyun
    Chen Yi
    Qi Xiaoli
    Yue Min
    He Yanshan
    2009 1ST IEEE SYMPOSIUM ON WEB SOCIETY, PROCEEDINGS, 2009, : 166 - 171
  • [49] Density-Based Clustering of Data Streams at Multiple Resolutions
    Wan, Li
    Ng, Wee Keong
    Dang, Xuan Hong
    Yu, Philip S.
    Zhang, Kuan
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2009, 3 (03)
  • [50] Data Streams Clustering Algorithm Based on Grid and Particle Swarm Optimization
    Ke, Luo
    Lin, Wang
    2009 INTERNATIONAL FORUM ON COMPUTER SCIENCE-TECHNOLOGY AND APPLICATIONS, VOL 1, PROCEEDINGS, 2009, : 93 - 96