Online Clustering for Evolving Data Streams with Online Anomaly Detection

被引:21
|
作者
Chenaghlou, Milad [1 ]
Moshtaghi, Masud [1 ]
Leckie, Christopher [1 ]
Salehi, Mahsa [2 ]
机构
[1] Univ Melbourne, Dept Comp & Informat Syst, Melbourne, Vic 3010, Australia
[2] Monash Univ, Fac Informat Technol, Melbourne, Vic 3168, Australia
关键词
D O I
10.1007/978-3-319-93037-4_40
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering data streams is an emerging challenge with a wide range of applications in areas including Wireless Sensor Networks, the Internet of Things, finance and social media. In an evolving data stream, a clustering algorithm is desired to both (a) assign observations to clusters and (b) identify anomalies in real-time. Current state-of-the-art algorithms in the literature do not address feature (b) as they only consider the spatial proximity of data, which results in (1) poor clustering and (2) poor demonstration of the temporal evolution of data in noisy environments. In this paper, we propose an online clustering algorithm that considers the temporal proximity of observations as well as their spatial proximity to identify anomalies in real-time. It identifies the evolution of clusters in noisy streams, incrementally updates the model and calculates the minimum window length over the evolving data stream without jeopardizing performance. To the best of our knowledge, this is the first online clustering algorithm that identifies anomalies in real-time and discovers the temporal evolution of clusters. Our contributions are supported by synthetic as well as real-world data experiments.
引用
收藏
页码:506 / 519
页数:14
相关论文
共 50 条
  • [1] Online embedding and clustering of evolving data streams
    Zubaroglu, Alaettin
    Atalay, Volkan
    [J]. STATISTICAL ANALYSIS AND DATA MINING, 2023, 16 (01) : 29 - 44
  • [2] Online Sparse Representation Clustering for Evolving Data Streams
    Chen, Jie
    Yang, Shengxiang
    Fahy, Conor
    Wang, Zhu
    Guo, Yinan
    Chen, Yingke
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, : 1 - 15
  • [3] OHODIN - Online Anomaly Detection for Data Streams
    Gruhl, Christian
    Tomforde, Sven
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON AUTONOMIC COMPUTING AND SELF-ORGANIZING SYSTEMS COMPANION (ACSOS-C 2021), 2021, : 193 - 197
  • [4] Clustering Evolving Batch System Jobs for Online Anomaly Detection
    Kuehn, Eileen
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOP (ICDMW), 2015, : 1534 - 1535
  • [5] Online Anomaly Detection over Big Data Streams
    Rettig, Laura
    Khayati, Mourad
    Cudre-Mauroux, Philippe
    Piorkowski, Michal
    [J]. PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 1113 - 1122
  • [6] Online Clustering for Topic Detection in Social Data Streams
    Comito, Carmela
    Pizzuti, Clara
    Procopio, Nicola
    [J]. 2016 IEEE 28TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2016), 2016, : 362 - 369
  • [7] CPOCEDS-concept preserving online clustering for evolving data streams
    Jafseer, K. T.
    Shailesh, S.
    Sreekumar, A.
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (03): : 2983 - 2998
  • [8] Fully online clustering of evolving data streams into arbitrarily shaped clusters
    Hyde, Richard
    Angelov, Plamen
    MacKenzie, A. R.
    [J]. INFORMATION SCIENCES, 2017, 382 : 96 - 114
  • [9] Online Clustering for Novelty Detection and Concept Drift in Data Streams
    Garcia, Kemilly Dearo
    Poel, Mannes
    Kok, Joost N.
    de Carvalho, Andre C. P. L. F.
    [J]. PROGRESS IN ARTIFICIAL INTELLIGENCE, PT II, 2019, 11805 : 448 - 459
  • [10] Online clustering of parallel data streams
    Beringer, Juergen
    Huellermeier, Eyke
    [J]. DATA & KNOWLEDGE ENGINEERING, 2006, 58 (02) : 180 - 204