Autonomous anomaly detection for streaming data

被引:1
|
作者
Basheer, Muhammad Yunus Iqbal [1 ]
Ali, Azliza Mohd [1 ]
Hamid, Nurzeatul Hamimah Abdul [1 ]
Ariffin, Muhammad Azizi Mohd [1 ]
Osman, Rozianawaty [1 ]
Nordin, Sharifalillah [1 ]
Gu, Xiaowei [2 ]
机构
[1] Univ Teknol MARA, Coll Comp Informat & Math, Sch Comp Sci, Shah Alam, Selangor, Malaysia
[2] Univ Surrey, Dept Comp Sci, Guildford GU2 7XH, England
关键词
Anomaly detection; Data stream; Online clustering; AUTOMATIC DETECTION; CLASSIFICATION;
D O I
10.1016/j.knosys.2023.111235
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Anomaly detection from data streams is a hotly studied topic in the machine learning domain. It is widely considered a challenging task because the underlying patterns exhibited by the streaming data may dynamically change at any time. In this paper, a new algorithm is proposed to detect anomalies autonomously for streaming data. The proposed algorithm is nonparametric and does not require any threshold to be preset by users. The algorithmic procedure of the proposed algorithm is composed of the following three complementary stages. Firstly, the potentially anomalous samples that represent highly different patterns from others are identified from data streams based on data density. Then, these potentially anomalous samples are clustered online using the evolving autonomous data partitioning algorithm. Finally, true anomalies are identified from these minor clusters with the least amounts of samples associated with them. Numerical examples based on three benchmark datasets demonstrated the potential of the proposed algorithm as a highly effective approach for anomaly detection from data streams.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Online data-driven anomaly detection in autonomous robots
    Khalastchi, Eliahu
    Kalech, Meir
    Kaminka, Gal A.
    Lin, Raz
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2015, 43 (03) : 657 - 688
  • [32] Online data-driven anomaly detection in autonomous robots
    Eliahu Khalastchi
    Meir Kalech
    Gal A. Kaminka
    Raz Lin
    [J]. Knowledge and Information Systems, 2015, 43 : 657 - 688
  • [33] Data-Driven Anomaly Detection Approach for Time-Series Streaming Data
    Zhang, Minghu
    Guo, Jianwen
    Li, Xin
    Jin, Rui
    [J]. SENSORS, 2020, 20 (19) : 1 - 17
  • [34] Anomaly detection in streaming environmental sensor data: A data-driven modeling approach
    Hill, David J.
    Minsker, Barbara S.
    [J]. ENVIRONMENTAL MODELLING & SOFTWARE, 2010, 25 (09) : 1014 - 1022
  • [35] PROBABILISTIC REASONING FOR STREAMING ANOMALY DETECTION
    Carter, Kevin M.
    Streilein, William W.
    [J]. 2012 IEEE STATISTICAL SIGNAL PROCESSING WORKSHOP (SSP), 2012, : 377 - 380
  • [36] SAND: Streaming Subsequence Anomaly Detection
    Boniol, Paul
    Paparrizos, John
    Palpanas, Themis
    Franklin, Michael J.
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2021, 14 (10): : 1717 - 1729
  • [37] Anomaly Detection over Streaming Data: Indy500 Case Study
    Widanage, Chathura
    Li, Jiayu
    Tyagi, Sahil
    Teja, Ravi
    Peng, Bo
    Kamburugamuve, Supun
    Koskey, Jon
    Baum, Dan
    Smith, Dayle M.
    Qiu, Judy
    [J]. 2019 IEEE 12TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (IEEE CLOUD 2019), 2019, : 9 - 16
  • [38] Anomaly detection in streaming data with gaussian process based stochastic differential equations
    Glyn-Davies, Alex
    Girolami, Mark
    [J]. PATTERN RECOGNITION LETTERS, 2022, 153 : 254 - 260
  • [39] Anomaly detection for streaming data based on grid-clustering and Gaussian distribution
    Zou, Beiji
    Yang, Kangkang
    Kui, Xiaoyan
    Liu, Jun
    Liao, Shenghui
    Zhao, Wei
    [J]. INFORMATION SCIENCES, 2023, 638
  • [40] Real-time anomaly detection using parallelized intrusion detection architecture for streaming data
    Chellammal, P.
    Malarchelvi, Sheba Kezia P. D.
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2020, 32 (04):