Online Clustering for Evolving Data Streams with Online Anomaly Detection

被引:21
|
作者
Chenaghlou, Milad [1 ]
Moshtaghi, Masud [1 ]
Leckie, Christopher [1 ]
Salehi, Mahsa [2 ]
机构
[1] Univ Melbourne, Dept Comp & Informat Syst, Melbourne, Vic 3010, Australia
[2] Monash Univ, Fac Informat Technol, Melbourne, Vic 3168, Australia
关键词
D O I
10.1007/978-3-319-93037-4_40
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering data streams is an emerging challenge with a wide range of applications in areas including Wireless Sensor Networks, the Internet of Things, finance and social media. In an evolving data stream, a clustering algorithm is desired to both (a) assign observations to clusters and (b) identify anomalies in real-time. Current state-of-the-art algorithms in the literature do not address feature (b) as they only consider the spatial proximity of data, which results in (1) poor clustering and (2) poor demonstration of the temporal evolution of data in noisy environments. In this paper, we propose an online clustering algorithm that considers the temporal proximity of observations as well as their spatial proximity to identify anomalies in real-time. It identifies the evolution of clusters in noisy streams, incrementally updates the model and calculates the minimum window length over the evolving data stream without jeopardizing performance. To the best of our knowledge, this is the first online clustering algorithm that identifies anomalies in real-time and discovers the temporal evolution of clusters. Our contributions are supported by synthetic as well as real-world data experiments.
引用
收藏
页码:506 / 519
页数:14
相关论文
共 50 条
  • [31] Self-Supervised Learning for Online Anomaly Detection in High-Dimensional Data Streams
    Mozaffari, Mahsa
    Doshi, Keval
    Yilmaz, Yasin
    [J]. ELECTRONICS, 2023, 12 (09)
  • [32] Online k-means Clustering on Arbitrary Data Streams
    Bhattacharjee, Robi
    Imola, Jacob John
    Moshkovitz, Michal
    Dasgupta, Sanjoy
    [J]. INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 201, 2023, 201 : 204 - 236
  • [33] Adaptive Model Pooling for Online Deep Anomaly Detection from a Complex Evolving Data Stream
    Yoon, Susik
    Lee, Youngjun
    Lee, Jae-Gil
    Lee, Byung Suk
    [J]. PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 2347 - 2357
  • [34] Self-adaptive and dynamic clustering for online anomaly detection
    Lee, Seungmin
    Kim, Gisung
    Kim, Sehun
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (12) : 14891 - 14898
  • [35] A buffer-based online clustering for evolving data stream
    Islam, Md. Kamrul
    Ahmed, Md. Manjur
    Zamli, Kamal Z.
    [J]. INFORMATION SCIENCES, 2019, 489 : 113 - 135
  • [36] Online reliable semi-supervised learning on evolving data streams
    Din, Salah Ud
    Shao, Junming
    Kumar, Jay
    Ali, Waqar
    Liu, Jiaming
    Ye, Yu
    [J]. INFORMATION SCIENCES, 2020, 525 : 153 - 171
  • [37] Online GRNN-Based Ensembles for Regression on Evolving Data Streams
    Duda, Piotr
    Jaworski, Maciej
    Rutkowski, Leszek
    [J]. ADVANCES IN NEURAL NETWORKS - ISNN 2018, 2018, 10878 : 221 - 228
  • [38] The Analysis of Online Event Streams: Predicting the Next Activity for Anomaly Detection
    Lee, Suhwan
    Lu, Xixi
    Reijer, Hajo A.
    [J]. RESEARCH CHALLENGES IN INFORMATION SCIENCE, 2022, 446 : 248 - 264
  • [39] Online Self-Evolving Anomaly Detection for Reliable Cloud Computing
    Bai, Tianyu
    Wang, Haili
    Guo, Jingda
    Ma, Xu
    Talasila, Mahendra
    Tang, Sihai
    Fu, Song
    Yang, Qing
    [J]. 2022 IEEE/ACM 15TH INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING, UCC, 2022, : 31 - 40
  • [40] Novelty Detection and Online Learning for Chunk Data Streams
    Wang, Yi
    Ding, Yi
    He, Xiangjian
    Fan, Xin
    Lin, Chi
    Li, Fengqi
    Wang, Tianzhu
    Luo, Zhongxuan
    Luo, Jiebo
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (07) : 2400 - 2412