Dynamically Evolving Clustering for Data Streams

被引:0
|
作者
Baruah, Rashmi Dutta [1 ]
Angelov, Plamen [3 ]
Baruah, Diganta [2 ]
机构
[1] Sikkim Manipal Inst Technol, Dept Comp Sci & Engn, Majitar 737136, Sikkim, India
[2] Sikkim Manipal Inst Technol, Dept Informat Technol, Majitar 737136, Sikkim, India
[3] Univ Lancaster, Sch Comp & Commun, Lancaster LA1 4YW, England
关键词
data streams; evolving clustering; online clustering; incremental clustering;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a new online evolving clustering approach for streaming data is proposed, named Dynamically Evolving Clustering method. The clustering approach attempts to meet the following three key requirements of data stream clustering: (i) fast and memory efficient (ii) adaptive (iii) robust to noise. The proposed clustering approach processes one sample at a time and makes necessary changes to the model and then forgets the processed sample. This feature naturally makes it adaptive to changes in the data pattern. The clustering method considers both distance and weight before generating new clusters. This avoids generation of large number of clusters. Further, to capture the dynamics of the data stream, the weight uses an exponential decay model. Since in data streaming environment, a low density cluster can be outlier points or seed of actual cluster, DEC applies a strategy that enables detecting and removing only those low density clusters that are real outliers. To evaluate the performance of the proposed clustering approach, experiments were conducted using benchmark dataset. The results show that the Dynamically Evolving Clustering approach can separate the data well which are evolving in nature.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] SPARSE SUBSPACE CLUSTERING FOR EVOLVING DATA STREAMS
    Sui, Jinping
    Liu, Zhen
    Liu, Li
    Jung, Alexander
    Liu, Tianpeng
    Peng, Bo
    Li, Xiang
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 7455 - 7459
  • [2] Online embedding and clustering of evolving data streams
    Zubaroglu, Alaettin
    Atalay, Volkan
    [J]. STATISTICAL ANALYSIS AND DATA MINING, 2023, 16 (01) : 29 - 44
  • [3] Rapidly Labeling and Tracking Dynamically Evolving Concepts In Data Streams
    Parker, Brandon S.
    Khan, Latifur
    [J]. 2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2013, : 1161 - 1164
  • [4] Online Sparse Representation Clustering for Evolving Data Streams
    Chen, Jie
    Yang, Shengxiang
    Fahy, Conor
    Wang, Zhu
    Guo, Yinan
    Chen, Yingke
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, : 1 - 15
  • [5] Robust Clustering for Tracking Noisy Evolving Data Streams
    Nasraoui, Olfa
    Rojas, Carlos
    [J]. PROCEEDINGS OF THE SIXTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, 2006, : 619 - 623
  • [6] Clustering Based Active Learning for Evolving Data Streams
    Ienco, Dino
    Bifet, Albert
    Zliobaite, Indre
    Pfahringer, Bernhard
    [J]. DISCOVERY SCIENCE, 2013, 8140 : 79 - 93
  • [7] Time-sensitive clustering evolving textual data streams
    Ammar, Mohamed
    Hidri, Adel
    Sassi Hidri, Minyar
    [J]. INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2020, 63 (1-2) : 25 - 40
  • [8] StreamXM: An Adaptive Partitional Clustering Solution for Evolving Data Streams
    Anderson, Robert
    Koh, Yun Sing
    [J]. BIG DATA ANALYTICS AND KNOWLEDGE DISCOVERY, 2015, 9263 : 270 - 282
  • [9] Hierarchical clustering for multiple nominal data streams with evolving behaviour
    Sangma, Jerry W.
    Sarkar, Mekhla
    Pal, Vipin
    Agrawal, Amit
    Yogita
    [J]. COMPLEX & INTELLIGENT SYSTEMS, 2022, 8 (02) : 1737 - 1761
  • [10] S-RASTER: contraction clustering for evolving data streams
    Ulm, Gregor
    Smith, Simon
    Nilsson, Adrian
    Gustavsson, Emil
    Jirstrand, Mats
    [J]. JOURNAL OF BIG DATA, 2020, 7 (01)