Fast Memory Efficient Local Outlier Detection in Data Streams

被引:108
|
作者
Salehi, Mahsa [1 ,2 ,3 ]
Leckie, Christopher [2 ,3 ]
Bezdek, James C. [2 ,3 ]
Vaithianathan, Tharshan [4 ]
Zhang, Xuyun [5 ]
机构
[1] IBM Res, Melbourne, Vic 3010, Australia
[2] NICTA Victoria, Melbourne, Vic 3010, Australia
[3] Univ Melbourne, Dept Comp & Informat Syst, Melbourne, Vic 3010, Australia
[4] Univ Melbourne, Dept Elect & Elect Engn, Melbourne, Vic 3010, Australia
[5] Univ Auckland, Dept Elect & Comp Engn, Auckland 1010, New Zealand
关键词
Outlier detection; stream data mining; local outlier; memory efficiency;
D O I
10.1109/TKDE.2016.2597833
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Outlier detection is an important task in data mining, with applications ranging from intrusion detection to human gait analysis. With the growing need to analyze high speed data streams, the task of outlier detection becomes even more challenging as traditional outlier detection techniques can no longer assume that all the data can be stored for processing. While the well-known Local Outlier Factor (LOF) algorithm has an incremental version, it assumes unbounded memory to keep all previous data points. In this paper, we propose a memory efficient incremental local outlier (MiLOF) detection algorithm for data streams, and a more flexible version (MiLOF_F), both have an accuracy close to Incremental LOF but within a fixed memory bound. Our experimental results show that both proposed approaches have better memory and time complexity than Incremental LOF while having comparable accuracy. In addition, we show that MiLOF_F is robust to changes in the number of data points, the number of underlying clusters and the number of dimensions in the data stream. These results show that MiLOF/MiLOF_F are well suited to application environments with limited memory (e.g., wireless sensor networks), and can be applied to high volume data streams.
引用
收藏
页码:3246 / 3260
页数:15
相关论文
共 50 条
  • [1] A Fast and Efficient Local Outlier Detection in Data Streams
    Yang, Xing
    Zhou, Wenli
    Shu, Nanfei
    Zhang, Hao
    [J]. PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON IMAGE, VIDEO AND SIGNAL PROCESSING (IVSP 2019), 2019, : 111 - 116
  • [2] Fast Memory Efficient Local Outlier Detection in Data Streams (Extended Abstract)
    Salehi, Mahsa
    Leckie, Christopher
    Bezdek, James C.
    Vaithianathan, Tharshan
    Zhang, Xuyun
    [J]. 2017 IEEE 33RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2017), 2017, : 51 - 52
  • [3] DILOF: Effective and Memory Efficient Local Outlier Detection in Data Streams
    Na, Gyoung S.
    Kim, Donghyun
    Yu, Hwanjo
    [J]. KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 1993 - 2002
  • [4] CELOF: Effective and fast memory efficient local outlier detection in high-dimensional data streams
    Chen, Liang
    Wang, Wei
    Yang, Yun
    [J]. APPLIED SOFT COMPUTING, 2021, 102
  • [5] CELOF: Effective and fast memory efficient local outlier detection in high-dimensional data streams
    Chen, Liang
    Wang, Wei
    Yang, Yun
    [J]. Applied Soft Computing, 2021, 102
  • [6] A Fast and Efficient Algorithm for Outlier Detection Over Data Streams
    Hassaan, Mosab
    Maher, Hend
    Gouda, Karam
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (11) : 749 - 756
  • [7] Incremental local outlier detection for data streams
    Pokrajac, Dragojub
    Lazarevic, Aleksandar
    Latecki, Longin Jan
    [J]. 2007 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DATA MINING, VOLS 1 AND 2, 2007, : 504 - 515
  • [8] A Review of Local Outlier Factor Algorithms for Outlier Detection in Big Data Streams
    Alghushairy, Omar
    Alsini, Raed
    Soule, Terence
    Ma, Xiaogang
    [J]. BIG DATA AND COGNITIVE COMPUTING, 2021, 5 (01) : 1 - 24
  • [9] ELOF: fast and memory-efficient anomaly detection algorithm in data streams
    Yun Yang
    Liang Chen
    ChongJun Fan
    [J]. Soft Computing, 2021, 25 : 4283 - 4294
  • [10] ELOF: fast and memory-efficient anomaly detection algorithm in data streams
    Yang, Yun
    Chen, Liang
    Fan, ChongJun
    [J]. SOFT COMPUTING, 2021, 25 (06) : 4283 - 4294