Clustering over Evolving Data Streams Based on Online Recent-Biased Approximation

被引:0
|
作者
Fan, Wei [1 ]
Koyanagi, Yusuke [1 ]
Asakura, Koichi [2 ]
Watanabe, Toyohide [1 ]
机构
[1] Nagoya Univ, Dept Syst & Social Informat, Grad Sch Informat Sci, Chikusa Ku, Furo Cho, Nagoya, Aichi 4648603, Japan
[2] Daido Inst Technol, Sch Informat, Nagoya, Aichi 4578530, Japan
关键词
Clustering over evolving data streams; time series data; recent-biased approximation; data mining;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A growing number of real world applications deal with multiple evolving data streams. In this paper, a framework for clustering over evolving data streams is proposed taking advantage of recent-biased approximation. In recent-biased approximation, more details are preserved for recent data and fewer coefficients are kept for the whole data stream, which improves the efficiency of clustering and space usability greatly. Our framework consists of two phases. One is an online phase which approximates data streams and maintains the summary statistics incrementally. The other is an offline clustering phase which is able to perform dynamic clustering over data streams on all possible time horizons. As shown in complexity analyses and also validated by our empirical studies, our framework performed efficiently in the data stream environment while producing clustering results of very high quality.
引用
收藏
页码:12 / +
页数:3
相关论文
共 50 条
  • [1] Online embedding and clustering of evolving data streams
    Zubaroglu, Alaettin
    Atalay, Volkan
    [J]. STATISTICAL ANALYSIS AND DATA MINING, 2023, 16 (01) : 29 - 44
  • [2] Online Clustering for Evolving Data Streams with Online Anomaly Detection
    Chenaghlou, Milad
    Moshtaghi, Masud
    Leckie, Christopher
    Salehi, Mahsa
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2018, PT II, 2018, 10938 : 506 - 519
  • [3] Online Sparse Representation Clustering for Evolving Data Streams
    Chen, Jie
    Yang, Shengxiang
    Fahy, Conor
    Wang, Zhu
    Guo, Yinan
    Chen, Yingke
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, : 1 - 15
  • [4] Density and sliding window-based clustering over evolving data streams
    Yu, Yanwei
    Zhao, Jindong
    Zhang, Yonggang
    Wen, Changci
    [J]. ICIC Express Letters, Part B: Applications, 2015, 6 (08): : 2275 - 2283
  • [5] Incremental density-based ensemble clustering over evolving data streams
    Khan, Imran
    Huang, Joshua Z.
    Ivanov, Kamen
    [J]. NEUROCOMPUTING, 2016, 191 : 34 - 43
  • [6] Online Clustering of Evolving Data Streams Using a Density Grid-Based Method
    Tareq, Mustafa
    Sundararajan, Elankovan A.
    Mohd, Masnizah
    Sani, Nor Samsiah
    [J]. IEEE ACCESS, 2020, 8 : 166472 - 166490
  • [7] A recent-biased dimension reduction technique for time series data
    Zhao, YC
    Zhang, CQ
    Zhang, SC
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2005, 3518 : 751 - 757
  • [8] CPOCEDS-concept preserving online clustering for evolving data streams
    Jafseer, K. T.
    Shailesh, S.
    Sreekumar, A.
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (03): : 2983 - 2998
  • [9] Fully online clustering of evolving data streams into arbitrarily shaped clusters
    Hyde, Richard
    Angelov, Plamen
    MacKenzie, A. R.
    [J]. INFORMATION SCIENCES, 2017, 382 : 96 - 114
  • [10] Clustering Based Active Learning for Evolving Data Streams
    Ienco, Dino
    Bifet, Albert
    Zliobaite, Indre
    Pfahringer, Bernhard
    [J]. DISCOVERY SCIENCE, 2013, 8140 : 79 - 93