Robust Incremental Outlier Detection Approach Based on a New Metric in Data Streams

被引:9
|
作者
Degirmenci, Ali [1 ]
Karal, Omer [1 ]
机构
[1] Ankara Beyazit Univ AYBU, Dept Elect & Elect Engn, TR-06010 Ankara, Turkey
关键词
Anomaly detection; Measurement; Real-time systems; Labeling; Three-dimensional displays; Memory management; Licenses; Incremental learning; local outlier factor (LOF); new metric; outlier detection; robustness;
D O I
10.1109/ACCESS.2021.3131402
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Detecting outliers in real time from multivariate streaming data is a vital and challenging research topic in many areas. Recently introduced the incremental Local Outlier Factor (iLOF) approach and its variants have received considerable attention as they achieve high detection performance in data streams with varying distributions. However, these iLOF-based approaches still have some major limitations: i) Poor detection in high-dimensional data; ii) The difficulty of determining the proper nearest neighbor number k; iii) Instead of labeling the outlier, assigning a score to each sample that indicates the probability to be an outlier; iv) Inability to detect a long sequence (small cluster) of outliers. This article proposes a new robust outlier detection method (RiLOF) based on iLOF that can effectively overcome these limitations. In the RiLOF method, a novel metric called Median of Nearest Neighborhood Absolute Deviation (MoNNAD) has been developed that uses the median of the local absolute deviation of the samples LOF values. Unlike the previously reported LOF-based approaches, RiLOF is capable of achieving outlier detection in different data stream applications using the same hyperparameters. Extensive experiments performed on 15 different real-world data sets demonstrate that RiLOF remarkably outperforms 12 different state-of-the-art competitors.
引用
收藏
页码:160347 / 160360
页数:14
相关论文
共 50 条
  • [1] Incremental local outlier detection for data streams
    Pokrajac, Dragojub
    Lazarevic, Aleksandar
    Latecki, Longin Jan
    [J]. 2007 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DATA MINING, VOLS 1 AND 2, 2007, : 504 - 515
  • [2] Efficient density and cluster based incremental outlier detection in data streams
    Degirmenci, Ali
    Karal, Omer
    [J]. INFORMATION SCIENCES, 2022, 607 : 901 - 920
  • [3] INCREMENTAL PRINCIPAL COMPONENT ANALYSIS BASED OUTLIER DETECTION METHODS FOR SPATIOTEMPORAL DATA STREAMS
    Bhushan, Alka
    Sharker, Monir H.
    Karimi, Hassan A.
    [J]. ISPRS INTERNATIONAL WORKSHOP ON SPATIOTEMPORAL COMPUTING, 2015, : 67 - 71
  • [4] Improved incremental local outlier detection for data streams based on the landmark window model
    Aihua Li
    Weijia Xu
    Zhidong Liu
    Yong Shi
    [J]. Knowledge and Information Systems, 2021, 63 : 2129 - 2155
  • [5] Improved incremental local outlier detection for data streams based on the landmark window model
    Li, Aihua
    Xu, Weijia
    Liu, Zhidong
    Shi, Yong
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2021, 63 (08) : 2129 - 2155
  • [6] A new local distace-based outlier detection approach for fuzzy data by vertex metric
    Mohseni, Somayeh
    Jahromi, Alireza Fakharzade
    [J]. 2015 2ND INTERNATIONAL CONFERENCE ON KNOWLEDGE-BASED ENGINEERING AND INNOVATION (KBEI), 2015, : 550 - 553
  • [7] TADILOF: Time Aware Density-Based Incremental Local Outlier Detection in Data Streams
    Huang, Jen-Wei
    Zhong, Meng-Xun
    Jaysawal, Bijay Prasad
    [J]. SENSORS, 2020, 20 (20) : 1 - 25
  • [8] iMCOD: Incremental multi-class outlier detection model in data streams
    Degirmenci, Ali
    Karal, Omer
    [J]. KNOWLEDGE-BASED SYSTEMS, 2022, 258
  • [9] iMCOD: Incremental multi-class outlier detection model in data streams
    Degirmenci, Ali
    Karal, Omer
    [J]. Knowledge-Based Systems, 2022, 258
  • [10] Distance-based Outlier Detection in Data Streams
    Tran, Luan
    Fan, Liyue
    Shahabi, Cyrus
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2016, 9 (12): : 1089 - 1100