A Systematic Review of Density Grid-Based Clustering for Data Streams

被引:14
|
作者
Tareq, Mustafa [1 ]
Sundararajan, Elankovan A. [2 ]
Harwood, Aaron [3 ]
Abu Bakar, Azuraliza [4 ]
机构
[1] Al Hikma Univ Coll, Dept Comp Technol Engn, Baghdad 10015, Iraq
[2] Univ Kebangsaan Malaysia, Fac Informat Sci & Technol, Ctr Software Technol & Management, Bangi 43600, Selangor, Malaysia
[3] Univ Melbourne, Sch Comp & Informat Sci, Melbourne, Vic 3010, Australia
[4] Univ Kebangsaan Malaysia, Ctr Articial Intelligence & Technol, Fac Informat Sci & Technol, Bangi 43600, Selangor, Malaysia
关键词
Clustering algorithms; Data mining; Systematics; Protocols; Databases; Real-time systems; Licenses; Clustering; data stream; grid-based clustering; data stream clustering; density-based clustering; EVOLVING DATA STREAMS; ALGORITHM; FUTURE;
D O I
10.1109/ACCESS.2021.3134704
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Various applications, such as electronic business, satellite remote sensing, intrusion discovery, and network traffic monitoring, generate large unbounded data stream sequences at a rapid pace. The clustering of data streams has attracted considerable interest due to the increasing usage of evolving data streams. In particular, evolving data streams affect clustering because they introduce numerous challenges, such as time and memory limits and one-pass clustering. Furthermore, researchers need to be able to determine arbitrarily shaped clusters present in evolving data streams from applications. Due to these characteristics, conventional density grid-based clustering techniques cannot be used. Moreover, the existing density grid-based clustering algorithms have low cluster quality for clustering evolving data streams. This study conducted a systematic literature review (SLR) and noted numerous research-related issues encountered in solving the aforementioned problems. We summarized numerous grid-based clustering algorithms that have been used and determined their distinctive and limited features. We also observed how these algorithms address the challenges affecting the clustering of evolving data streams and studied their advantages and disadvantages. SLR was based on 104 articles published between 2010 and 2021. Numerous challenges remain for grid-based clustering algorithms, particularly in terms of time-limited and high-dimensional data handling. Last, our findings indicated a variety of active studies on density grid-based clustering.
引用
收藏
页码:579 / 596
页数:18
相关论文
共 50 条
  • [21] An Efficient Grid-based Clustering Method by Finding Density Peaks
    Wu, Bo
    Wilamowski, B. M.
    PROCEEDINGS OF THE IECON 2016 - 42ND ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2016, : 837 - 842
  • [22] Grid-based clustering algorithm based on intersecting partition and density estimation
    Qiu, Bao-Zhi
    Li, Xiang-Li
    Shen, Jun-Yi
    EMERGING TECHNOLOGIES IN KNOWLEDGE DISCOVERY AND DATA MINING, 2007, 4819 : 368 - +
  • [23] Flexible grid-based clustering
    Akodjenou-Jeannin, Marc-Ismael
    Salamatian, Kave
    Gallinarl, Patrick
    KNOWLEDGE DISCOVERY IN DATABASES: PKDD 2007, PROCEEDINGS, 2007, 4702 : 350 - +
  • [24] Grid-Based DBSCAN for Clustering Extended Objects in Radar Data
    Kellner, Dominik
    Klappstein, Jens
    Dietmayer, Klaus
    2012 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2012, : 365 - 370
  • [25] The BANG-clustering system: Grid-based data analysis
    Schikuta, E
    Erhart, M
    ADVANCES IN INTELLIGENT DATA ANALYSIS: REASONING ABOUT DATA, 1997, 1280 : 513 - 524
  • [26] Grid-Based and Outlier Detection-Based Data Clustering and Classification
    Cho, Kyu Cheol
    Lee, Jong Sik
    UBIQUITOUS COMPUTING AND MULTIMEDIA APPLICATIONS, PT I, 2011, 150 : 129 - 138
  • [27] Grid-based & Outlier Detection-based Data Clustering & Classification
    Cho, Kyu Cheol
    Lee, Jong Sik
    INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2012, 15 (03): : 1253 - 1266
  • [28] Grid-based spectral fiber clustering
    Klein, Jan
    Bittihn, Philip
    Ledochowitsch, Peter
    Hahn, Horst K.
    Konrad, Olaf
    Rexilius, Jan
    Peitgen, Heinz-Otto
    MEDICAL IMAGING 2007: VISUALIZATION AND IMAGE-GUIDED PROCEDURES, PTS 1 AND 2, 2007, 6509
  • [29] A NOVEL GRID-BASED CLUSTERING ALGORITHM
    Starczewski, Artur
    Scherer, Magdalena M.
    Ksiazek, Wojciech
    Debski, Maciej
    Wang, Lipo
    JOURNAL OF ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING RESEARCH, 2021, 11 (04) : 319 - 330
  • [30] Grid-based dynamic clustering with grid proximity measure
    Lee, Gun Ho
    INTELLIGENT DATA ANALYSIS, 2016, 20 (04) : 853 - 875