Efficient density and cluster based incremental outlier detection in data streams

被引:39
|
作者
Degirmenci, Ali [1 ]
Karal, Omer [1 ]
机构
[1] Ankara Yildirim Beyazit Univ, Ayvali Mah 150,Sok Etlik Kecioren, Ankara, Turkey
关键词
LOF; DBSCAN; Outlier detection; Core KNN; Incremental learning; Data stream;
D O I
10.1016/j.ins.2022.06.013
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, a novel, parameter-free, incremental local density and cluster-based outlier factor (iLDCBOF) method is presented that unifies incremental versions of local outlier factor (LOF) and density-based spatial clustering of applications with noise (DBSCAN) to detect outliers efficiently in data streams. The iLDCBOF has many advanced advantages compared to previously reported iLOF-based studies: (1) it is based on a newly developed core k-nearest neighbor (CkNN) concept to reliably and scalably detect outliers from data streams and prevent the clustering of outliers; 2) it uses a newly-developed algorithm that automatically adjusts the value of the k (number of neighbors) parameter for different real-time applications; and 3) it uses the Mahalanobis distance metric, so its performance is not affected even for large amounts of data. The iLDCBOF method is well suited for different data stream applications because it requires no distribution assumptions, it is parameterless (determined automatically), and it is easy to implement. ROC-AUC and statistical test analysis results from extensive experiments performed on 16 different real world datasets showed that the iLDCBOF method significantly outperformed benchmark methods.(c) 2022 Elsevier Inc. All rights reserved.
引用
收藏
页码:901 / 920
页数:20
相关论文
共 50 条
  • [1] Incremental local outlier detection for data streams
    Pokrajac, Dragojub
    Lazarevic, Aleksandar
    Latecki, Longin Jan
    [J]. 2007 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DATA MINING, VOLS 1 AND 2, 2007, : 504 - 515
  • [2] TADILOF: Time Aware Density-Based Incremental Local Outlier Detection in Data Streams
    Huang, Jen-Wei
    Zhong, Meng-Xun
    Jaysawal, Bijay Prasad
    [J]. SENSORS, 2020, 20 (20) : 1 - 25
  • [3] Robust Incremental Outlier Detection Approach Based on a New Metric in Data Streams
    Degirmenci, Ali
    Karal, Omer
    [J]. IEEE ACCESS, 2021, 9 : 160347 - 160360
  • [4] Improved incremental local outlier detection for data streams based on the landmark window model
    Aihua Li
    Weijia Xu
    Zhidong Liu
    Yong Shi
    [J]. Knowledge and Information Systems, 2021, 63 : 2129 - 2155
  • [5] INCREMENTAL PRINCIPAL COMPONENT ANALYSIS BASED OUTLIER DETECTION METHODS FOR SPATIOTEMPORAL DATA STREAMS
    Bhushan, Alka
    Sharker, Monir H.
    Karimi, Hassan A.
    [J]. ISPRS INTERNATIONAL WORKSHOP ON SPATIOTEMPORAL COMPUTING, 2015, : 67 - 71
  • [6] Improved incremental local outlier detection for data streams based on the landmark window model
    Li, Aihua
    Xu, Weijia
    Liu, Zhidong
    Shi, Yong
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2021, 63 (08) : 2129 - 2155
  • [7] A Fast and Efficient Local Outlier Detection in Data Streams
    Yang, Xing
    Zhou, Wenli
    Shu, Nanfei
    Zhang, Hao
    [J]. PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON IMAGE, VIDEO AND SIGNAL PROCESSING (IVSP 2019), 2019, : 111 - 116
  • [8] Outlier detection based on cluster outlier factor and mutual density
    Zhang, Zhongping
    Zhu, Mengfan
    Qiu, Jingyang
    Liu, Cong
    Zhang, Debin
    Qi, Jie
    [J]. International Journal of Intelligent Information and Database Systems, 2019, 12 (1-2) : 91 - 108
  • [9] Outlier detection based on cluster outlier factor and mutual density
    Zhang, Zhongping
    Qiu, Jingyang
    Liu, Cong
    Zhu, Mengfan
    Zhang, Debin
    [J]. Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2019, 25 (09): : 2314 - 2323
  • [10] Fast Memory Efficient Local Outlier Detection in Data Streams
    Salehi, Mahsa
    Leckie, Christopher
    Bezdek, James C.
    Vaithianathan, Tharshan
    Zhang, Xuyun
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (12) : 3246 - 3260