Online Clustering for Evolving Data Streams with Online Anomaly Detection

被引:21
|
作者
Chenaghlou, Milad [1 ]
Moshtaghi, Masud [1 ]
Leckie, Christopher [1 ]
Salehi, Mahsa [2 ]
机构
[1] Univ Melbourne, Dept Comp & Informat Syst, Melbourne, Vic 3010, Australia
[2] Monash Univ, Fac Informat Technol, Melbourne, Vic 3168, Australia
关键词
D O I
10.1007/978-3-319-93037-4_40
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering data streams is an emerging challenge with a wide range of applications in areas including Wireless Sensor Networks, the Internet of Things, finance and social media. In an evolving data stream, a clustering algorithm is desired to both (a) assign observations to clusters and (b) identify anomalies in real-time. Current state-of-the-art algorithms in the literature do not address feature (b) as they only consider the spatial proximity of data, which results in (1) poor clustering and (2) poor demonstration of the temporal evolution of data in noisy environments. In this paper, we propose an online clustering algorithm that considers the temporal proximity of observations as well as their spatial proximity to identify anomalies in real-time. It identifies the evolution of clusters in noisy streams, incrementally updates the model and calculates the minimum window length over the evolving data stream without jeopardizing performance. To the best of our knowledge, this is the first online clustering algorithm that identifies anomalies in real-time and discovers the temporal evolution of clusters. Our contributions are supported by synthetic as well as real-world data experiments.
引用
收藏
页码:506 / 519
页数:14
相关论文
共 50 条
  • [41] Online Detection of Patterns in Semantic Trajectory Data Streams
    Roganovic, Milos B.
    Stojanovic, Dragan H.
    [J]. 2013 11TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS IN MODERN SATELLITE, CABLE AND BROADCASTING SERVICES (TELSIKS), VOLS 1 AND 2, 2013, : 575 - 578
  • [42] Anomaly detection of online monitoring data of power equipment based on association rules and clustering algorithm
    Cai, Yu-Xiang
    Cai, Li-Jun
    Lu, Zhou
    [J]. PROCEEDINGS OF THE 2ND ANNUAL INTERNATIONAL CONFERENCE ON ELECTRONICS, ELECTRICAL ENGINEERING AND INFORMATION SCIENCE (EEEIS 2016), 2016, 117 : 289 - 298
  • [43] Dynamically Evolving Clustering for Data Streams
    Baruah, Rashmi Dutta
    Angelov, Plamen
    Baruah, Diganta
    [J]. 2014 IEEE CONFERENCE ON EVOLVING AND ADAPTIVE INTELLIGENT SYSTEMS (EAIS), 2014,
  • [44] ONLINE REACTIVE ANOMALY DETECTION OVER STREAM DATA
    Fu, Yan
    Zhou, Jun-Lin
    Wu, Yue
    [J]. 2008 INTERNATIONAL CONFERENCE ON APPERCEIVING COMPUTING AND INTELLIGENCE ANALYSIS (ICACIA 2008), 2008, : 291 - 294
  • [45] Statistical hierarchical clustering algorithm for outlier detection in evolving data streams
    Dalibor Krleža
    Boris Vrdoljak
    Mario Brčić
    [J]. Machine Learning, 2021, 110 : 139 - 184
  • [46] A Framework for Outlier Detection in Evolving Data Streams by Weighting Attributes in Clustering
    Yogita
    Toshniwal, Durga
    [J]. 2ND INTERNATIONAL CONFERENCE ON COMMUNICATION, COMPUTING & SECURITY [ICCCS-2012], 2012, 1 : 214 - 222
  • [47] Statistical hierarchical clustering algorithm for outlier detection in evolving data streams
    Krleza, Dalibor
    Vrdoljak, Boris
    Brcic, Mario
    [J]. MACHINE LEARNING, 2021, 110 (01) : 139 - 184
  • [48] Online Learning and Prediction of Data Streams using Dynamically Evolving Fuzzy Approach
    Baruah, Rashmi Dutta
    Angelov, Plamen
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ - IEEE 2013), 2013,
  • [49] Evolving Spiking Neural Networks for online learning over drifting data streams
    Lobo, Jesus L.
    Lana, Ibai
    Del Ser, Javier
    Bilbao, Miren Nekane
    Kasabov, Nikola
    [J]. NEURAL NETWORKS, 2018, 108 : 1 - 19
  • [50] dSalmon: High-Speed Anomaly Detection for Evolving Multivariate Data Streams
    Hartl, Alexander
    Iglesias, Felix
    Zseby, Tanja
    [J]. PERFORMANCE EVALUATION METHODOLOGIES AND TOOLS, VALUETOOLS 2023, 2024, 539 : 153 - 169