ELOF: fast and memory-efficient anomaly detection algorithm in data streams

被引:4
|
作者
Yang, Yun [1 ]
Chen, Liang [2 ]
Fan, ChongJun [1 ]
机构
[1] Univ Shanghai Sci & Technol, Business Sch, Shanghai, Peoples R China
[2] East China Normal Univ, Shanghai, Peoples R China
关键词
Anomaly detection; LOF; ELOF; Data stream; LOCAL OUTLIER DETECTION;
D O I
10.1007/s00500-020-05442-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Anomaly detection in multivariate data is an import research field. Many studies have been proposed aiming to develop the local outlier factor (LOF). However, the existing LOF-based models have two major problems: (1) need a large amount of memory to store data; (2) unsatisfactory detection results in high-dimensional data. To this end, we propose a new data streams anomaly detection algorithm extract local outlier factor (ELOF). To reduce data storage, we first design a memory window mechanism to limit the amount of data storage; then, we design a new sub-data extraction model to extract the sub-data of the original data information. Through these two designs, the amount of data storage can be effectively reduced. Moreover, the model framework is based on the density discriminant method, and it can be widely applied to different real scenarios without any prior information or assumptions of data distribution. The final comprehensive experimental results show that the ELOF model has a great improvement than many common models in terms of accuracy. Furthermore, the running time of ELOF algorithm is less than 1% of the original LOF algorithm under the same data set. These results indicate that the ELOF improved model consumes less memory in real-time data streams anomaly detection and works better in high-dimensional data streams detection.
引用
收藏
页码:4283 / 4294
页数:12
相关论文
共 50 条
  • [1] ELOF: fast and memory-efficient anomaly detection algorithm in data streams
    Yun Yang
    Liang Chen
    ChongJun Fan
    [J]. Soft Computing, 2021, 25 : 4283 - 4294
  • [2] Fast Memory-efficient Anomaly Detection in Streaming Heterogeneous Graphs
    Manzoor, Emaad
    Milajerdi, Sadegh M.
    Akoglu, Leman
    [J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 1035 - 1044
  • [3] A Memory-Efficient Data Redistribution Algorithm
    Siegel, Stephen F.
    Siegel, Andrew R.
    [J]. RECENT ADVANCES IN PARALLEL VIRTUAL MACHINE AND MESSAGE PASSING INTERFACE, PROCEEDINGS, 2009, 5759 : 219 - +
  • [4] A memory-efficient and fast Huffman decoding algorithm
    Chen, HC
    Wang, YL
    Lan, YF
    [J]. INFORMATION PROCESSING LETTERS, 1999, 69 (03) : 119 - 122
  • [5] CICLAD: A Fast and Memory-efficient Closed Itemset Miner for Streams
    Martin, Tomas
    Francoeur, Guy
    Valtchev, Petko
    [J]. KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 1810 - 1818
  • [6] Fast Memory Efficient Local Outlier Detection in Data Streams
    Salehi, Mahsa
    Leckie, Christopher
    Bezdek, James C.
    Vaithianathan, Tharshan
    Zhang, Xuyun
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (12) : 3246 - 3260
  • [7] A Fast and Efficient Algorithm for Outlier Detection Over Data Streams
    Hassaan, Mosab
    Maher, Hend
    Gouda, Karam
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (11) : 749 - 756
  • [8] A Fast and Memory-Efficient Hierarchical Graph Clustering Algorithm
    Szilagyi, Laszlo
    Szilagyi, Sandor Miklos
    Hirsbrunner, Beat
    [J]. NEURAL INFORMATION PROCESSING (ICONIP 2014), PT I, 2014, 8834 : 247 - 254
  • [9] A FAST AND MEMORY-EFFICIENT ALGORITHM FOR ROBUST PCA (MEROP)
    Narayanamurthy, Praneeth
    Vaswani, Namrata
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 4684 - 4688
  • [10] A fast and memory-efficient hierarchical graph clustering algorithm
    Szilágyi, László
    Szilágyi, Sándor Miklós
    Hirsbrunner, Béat
    [J]. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8834 : 247 - 254