Equi-Clustream: a framework for clustering time evolving mixed data

被引:6
|
作者
Sangam, Ravi Sankar [1 ]
Om, Hari [2 ]
机构
[1] Natl Inst Technol, Dept Comp Sci & Engn, Tadepalligudem 534101, Andhra Prades, India
[2] Indian Inst Technol, Dept Comp Sci & Engn, Indian Sch Mines, Dhanbad 826004, Jharkhand, India
关键词
Clustering; Data streams; Time-evolving data; Data mining; DATA STREAMS; ALGORITHM;
D O I
10.1007/s11634-018-0316-3
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In data stream environment, most of the conventional clustering algorithms are not sufficiently efficient, since large volumes of data arrive in a stream and these data points unfold with time. The problem of clustering time-evolving metric data and categorical time-evolving data has separately been well explored in recent years, but the problem of clustering mixed type time-evolving data remains a challenging issue due to an awkward gap between the structure of metric and categorical attributes. In this paper, we devise a generalized framework, termed Equi-Clustream to dynamically cluster mixed type time-evolving data, which comprises three algorithms: a Hybrid Drifting Concept Detection Algorithm that detects the drifting concept between the current sliding window and previous sliding window, a Hybrid Data Labeling Algorithm that assigns an appropriate cluster label to each data vector of the current non-drifting window based on the clustering result of the previous sliding window, and a visualization algorithm that analyses the relationship between the clusters at different timestamps and also visualizes the evolving trends of the clusters. The efficacy of the proposed framework is shown by experiments on synthetic and real world datasets.
引用
收藏
页码:973 / 995
页数:23
相关论文
共 50 条
  • [31] Design of 2-Level Clustering Framework for Time Series Data Sets
    Thakur, G. S.
    Thakur, R. S.
    Thakur, Ravi Singh
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON SOFT COMPUTING FOR PROBLEM SOLVING (SOCPROS 2011), VOL 2, 2012, 131 : 205 - +
  • [32] Understanding time use via data mining: A clustering-based framework
    Rosales-Salas, Jorge
    Maldonado, Sebastian
    Seret, Alex
    INTELLIGENT DATA ANALYSIS, 2018, 22 (03) : 597 - 616
  • [33] Online Sparse Representation Clustering for Evolving Data Streams
    Chen, Jie
    Yang, Shengxiang
    Fahy, Conor
    Wang, Zhu
    Guo, Yinan
    Chen, Yingke
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 525 - 539
  • [34] Online Sparse Representation Clustering for Evolving Data Streams
    Chen, Jie
    Yang, Shengxiang
    Fahy, Conor
    Wang, Zhu
    Guo, Yinan
    Chen, Yingke
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 525 - 539
  • [35] Evolving Local Means Method for Clustering of Streaming Data
    Baruah, Rashmi Dutta
    Angelov, Plamen
    2012 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2012,
  • [36] Local Motif Clustering on Time-Evolving Graphs
    Fu, Dongqi
    Zhou, Dawei
    He, Jingrui
    KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 390 - 400
  • [37] Robust Clustering for Tracking Noisy Evolving Data Streams
    Nasraoui, Olfa
    Rojas, Carlos
    PROCEEDINGS OF THE SIXTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, 2006, : 619 - 623
  • [38] Clustering Based Active Learning for Evolving Data Streams
    Ienco, Dino
    Bifet, Albert
    Zliobaite, Indre
    Pfahringer, Bernhard
    DISCOVERY SCIENCE, 2013, 8140 : 79 - 93
  • [39] Synchronization-based clustering on evolving data stream
    Shao, Junming
    Tan, Yue
    Gao, Lianli
    Yang, Qinli
    Plant, Claudia
    Assent, Ira
    INFORMATION SCIENCES, 2019, 501 : 573 - 587
  • [40] Incremental Clustering Approach for Evolving Trajectory Data Stream
    Shein, Thi Thi
    Puntheeranurak, Sutheera
    2018 6TH INTERNATIONAL ELECTRICAL ENGINEERING CONGRESS (IEECON), 2018,