Incremental local outlier detection for data streams

被引:214
|
作者
Pokrajac, Dragojub [1 ]
Lazarevic, Aleksandar [2 ]
Latecki, Longin Jan [3 ]
机构
[1] Delaware State Univ, CIS Dept, AMRC, Dover, DE 19901 USA
[2] United Technol Res Ctr, E Hartford, CT 06108 USA
[3] Temple Univ, CIS Dept, Philadelphia, PA 19122 USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/CIDM.2007.368917
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Outlier detection has recently become an important problem in many industrial and financial applications. This problem is further complicated by the fact that in many cases, outliers have to be detected from data streams that arrive at an enormous pace. In this paper, an incremental LOF (Local Outlier Factor) algorithm, appropriate for detecting outliers in data streams, is proposed. The proposed incremental LOF algorithm provides equivalent detection performance as the iterated static LOF algorithm (applied after insertion of each data record), while requiring significantly less computational time. In addition, the incremental LOF algorithm also dynamically updates the profiles of data points. This is a very important property, since data profiles may change over time. The paper provides theoretical evidence that insertion of a new data point as well as deletion of an old data point influence only limited number of their closest neighbors and thus the number of updates per such insertion/deletion does not depend on the total number of points N in the data set. Our experiments performed on several simulated and real life data sets have demonstrated that the proposed incremental LOF algorithm is computationally efficient, while at the same time very successful in detecting outliers and changes of distributional behavior in various data stream applications.
引用
收藏
页码:504 / 515
页数:12
相关论文
共 50 条
  • [1] Improved incremental local outlier detection for data streams based on the landmark window model
    Aihua Li
    Weijia Xu
    Zhidong Liu
    Yong Shi
    [J]. Knowledge and Information Systems, 2021, 63 : 2129 - 2155
  • [2] Improved incremental local outlier detection for data streams based on the landmark window model
    Li, Aihua
    Xu, Weijia
    Liu, Zhidong
    Shi, Yong
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2021, 63 (08) : 2129 - 2155
  • [3] TADILOF: Time Aware Density-Based Incremental Local Outlier Detection in Data Streams
    Huang, Jen-Wei
    Zhong, Meng-Xun
    Jaysawal, Bijay Prasad
    [J]. SENSORS, 2020, 20 (20) : 1 - 25
  • [4] An Incremental Local Outlier Detection Method in the Data Stream
    Yao, Haiqing
    Fu, Xiuwen
    Yang, Yongsheng
    Postolache, Octavian
    [J]. APPLIED SCIENCES-BASEL, 2018, 8 (08):
  • [5] A Fast and Efficient Local Outlier Detection in Data Streams
    Yang, Xing
    Zhou, Wenli
    Shu, Nanfei
    Zhang, Hao
    [J]. PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON IMAGE, VIDEO AND SIGNAL PROCESSING (IVSP 2019), 2019, : 111 - 116
  • [6] Efficient density and cluster based incremental outlier detection in data streams
    Degirmenci, Ali
    Karal, Omer
    [J]. INFORMATION SCIENCES, 2022, 607 : 901 - 920
  • [7] A Review of Local Outlier Factor Algorithms for Outlier Detection in Big Data Streams
    Alghushairy, Omar
    Alsini, Raed
    Soule, Terence
    Ma, Xiaogang
    [J]. BIG DATA AND COGNITIVE COMPUTING, 2021, 5 (01) : 1 - 24
  • [8] Fast Memory Efficient Local Outlier Detection in Data Streams
    Salehi, Mahsa
    Leckie, Christopher
    Bezdek, James C.
    Vaithianathan, Tharshan
    Zhang, Xuyun
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (12) : 3246 - 3260
  • [9] iMCOD: Incremental multi-class outlier detection model in data streams
    Degirmenci, Ali
    Karal, Omer
    [J]. KNOWLEDGE-BASED SYSTEMS, 2022, 258
  • [10] Robust Incremental Outlier Detection Approach Based on a New Metric in Data Streams
    Degirmenci, Ali
    Karal, Omer
    [J]. IEEE ACCESS, 2021, 9 : 160347 - 160360