A Clustering Algorithm for Evolving Data Streams Using Temporal Spatial Hyper Cube

被引:3
|
作者
Al-amri, Redhwan [1 ]
Murugesan, Raja Kumar [1 ]
Almutairi, Mubarak [2 ]
Munir, Kashif [3 ]
Alkawsi, Gamal [4 ]
Baashar, Yahia [5 ]
机构
[1] Taylors Univ, Sch Comp Sci, Subang Jaya 47500, Malaysia
[2] Univ Hafr Albatin 1083, Coll Comp Sci & Engn, Hafr Albatin 31991, Saudi Arabia
[3] Khwaja Fareed Univ Engn & Informat Technol, Dept Comp Sci, Rahim Yar Khan 64200, Pakistan
[4] Univ Tenaga Nasl UNITEN, Inst Sustainable Energy ISE, Kajang 43000, Malaysia
[5] Univ Malaysia Sabah UMS, Fac Comp & Informat, Kota Kinabalu 88400, Sabah, Malaysia
来源
APPLIED SCIENCES-BASEL | 2022年 / 12卷 / 13期
关键词
density-based clustering; evolving data stream; temporal spatial hyper cube; DENSITY; CODAS;
D O I
10.3390/app12136523
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
As applications generate massive amounts of data streams, the requirement for ways to analyze and cluster this data has become a critical field of research for knowledge discovery. Data stream clustering's primary objective and goal are to acquire insights into incoming data. Recognizing all possible patterns in data streams that enter at variable rates and structures and evolve over time is critical for acquiring insights. Analyzing the data stream has been one of the vital research areas due to the inevitable evolving aspect of the data stream and its vast application domains. Existing algorithms for handling data stream clustering consider adding various data summarization structures starting from grid projection and ending with buffers of Core-Micro and Macro clusters. However, it is found that the static assumption of the data summarization impacts the quality of clustering. To fill this gap, an online clustering algorithm for handling evolving data streams using a tempo-spatial hyper cube called BOCEDS TSHC has been developed in this research. The role of the tempo-spatial hyper cube (TSHC) is to add more dimensions to the data summarization for more degree of freedom. TSHC when added to Buffer-based Online Clustering for Evolving Data Stream (BOCEDS) results in a superior evolving data stream clustering algorithm. Evaluation based on both the real world and synthetic datasets has proven the superiority of the developed BOCEDS TSHC clustering algorithm over the baseline algorithms with respect to most of the clustering metrics.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] An incremental algorithm for clustering spatial data streams: exploring temporal locality
    Wei, Ling-Yin
    Peng, Wen-Chih
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2013, 37 (02) : 453 - 483
  • [2] An incremental algorithm for clustering spatial data streams: exploring temporal locality
    Ling-Yin Wei
    Wen-Chih Peng
    [J]. Knowledge and Information Systems, 2013, 37 : 453 - 483
  • [3] Statistical hierarchical clustering algorithm for outlier detection in evolving data streams
    Dalibor Krleža
    Boris Vrdoljak
    Mario Brčić
    [J]. Machine Learning, 2021, 110 : 139 - 184
  • [4] Statistical hierarchical clustering algorithm for outlier detection in evolving data streams
    Krleza, Dalibor
    Vrdoljak, Boris
    Brcic, Mario
    [J]. MACHINE LEARNING, 2021, 110 (01) : 139 - 184
  • [5] Dynamically Evolving Clustering for Data Streams
    Baruah, Rashmi Dutta
    Angelov, Plamen
    Baruah, Diganta
    [J]. 2014 IEEE CONFERENCE ON EVOLVING AND ADAPTIVE INTELLIGENT SYSTEMS (EAIS), 2014,
  • [6] A single pass algorithm for clustering evolving data streams based on swarm intelligence
    Agostino Forestiero
    Clara Pizzuti
    Giandomenico Spezzano
    [J]. Data Mining and Knowledge Discovery, 2013, 26 : 1 - 26
  • [7] A single pass algorithm for clustering evolving data streams based on swarm intelligence
    Forestiero, Agostino
    Pizzuti, Clara
    Spezzano, Giandomenico
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2013, 26 (01) : 1 - 26
  • [8] Flock Stream: a Bio-inspired Algorithm for Clustering Evolving Data Streams
    Forestiero, Agostino
    Pizzuti, Clara
    Spezzano, Giandomenico
    [J]. ICTAI: 2009 21ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, 2009, : 1 - 8
  • [9] SPARSE SUBSPACE CLUSTERING FOR EVOLVING DATA STREAMS
    Sui, Jinping
    Liu, Zhen
    Liu, Li
    Jung, Alexander
    Liu, Tianpeng
    Peng, Bo
    Li, Xiang
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 7455 - 7459
  • [10] Online embedding and clustering of evolving data streams
    Zubaroglu, Alaettin
    Atalay, Volkan
    [J]. STATISTICAL ANALYSIS AND DATA MINING, 2023, 16 (01) : 29 - 44