Online Outlier Detection for Data Streams

被引:0
|
作者
Sadik, Shiblee [1 ]
Gruenwald, Le [1 ]
机构
[1] Univ Oklahoma, Norman, OK 73019 USA
关键词
Knowledge Discovery; Data Mining; Stream Databases;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Outlier detection is a well established area of statistics but most of the existing outlier detection techniques are designed for applications where the entire dataset is available for random access. A typical outlier detection technique constructs a standard data distribution or model and identifies the deviated data points from the model as outliers. Evidently these techniques are not suitable for online data streams where the entire dataset, due to its unbounded volume, is not available for random access. Moreover, the data distribution in data streams change over time which challenges the existing outlier detection techniques that assume a constant standard data distribution for the entire dataset. In addition, data streams are characterized by uncertainty which imposes further complexity. In this paper we propose an adaptive, online outlier detection technique addressing the aforementioned characteristics of data streams, called Adaptive Outlier Detection for Data Streams (A-ODDS), which identifies outliers with respect to all the received data points as well as temporally close data points. The temporally close data points are selected based on time and change of data distribution. We also present an efficient and online implementation of the technique and a performance study showing the superiority of A-ODDS over existing techniques in terms of accuracy and execution time on a real-life dataset collected from meteorological applications.
引用
收藏
页码:88 / 96
页数:9
相关论文
共 50 条
  • [31] Outlier Detection in Data Streams - A Comparative Study of Selected Methods
    Duraj, Agnieszka
    Szczepaniak, Piotr S.
    [J]. KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KSE 2021), 2021, 192 : 2769 - 2778
  • [32] Outlier Detection over Sliding Windows for Probabilistic Data Streams
    Bin Wang
    Xiao-Chun Yang
    Guo-Ren Wang
    Ge Yu
    [J]. Journal of Computer Science and Technology, 2010, 25 : 389 - 400
  • [33] Outlier Detection over Sliding Windows for Probabilistic Data Streams
    Wang, Bin
    Yang, Xiao-Chun
    Wang, Guo-Ren
    Yu, Ge
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2010, 25 (03) : 389 - 400
  • [34] GPU-accelerated Outlier Detection for Continuous Data Streams
    HewaNadungodage, Chandima
    Xia, Yuni
    Lee, John Jaehwan
    [J]. 2016 IEEE 30TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2016), 2016, : 1133 - 1142
  • [35] An Outlier Detection Algorithm for Data Streams Based on Fuzzy Clustering
    Su, Xiaoke
    Qin, Yuming
    Wan, Renxia
    [J]. PROGRESS IN INTELLIGENCE COMPUTATION AND APPLICATIONS, 2008, : 109 - 112
  • [36] Explainable Distance-Based Outlier Detection in Data Streams
    Toliopoulos, Theodoros
    Gounaris, Anastasios
    [J]. IEEE ACCESS, 2022, 10 : 47921 - 47936
  • [37] An Adaptive Clustering Approach for Distributed Outlier Detection in Data Streams
    Della Monaca, Andrea
    Cafaro, Massimo
    Pulimeno, Marco
    Epicoco, Italo
    [J]. 19TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE, 2023, 583 : 86 - 99
  • [38] Outlier Detection over Sliding Windows for Probabilistic Data Streams
    王斌
    杨晓春
    王国仁
    于戈
    [J]. Journal of Computer Science & Technology, 2010, 25 (03) : 389 - 400
  • [39] Outlier Detection in Graph Streams
    Aggarwal, Charu C.
    Zhao, Yuchen
    Yu, Philip S.
    [J]. IEEE 27TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2011), 2011, : 399 - 409
  • [40] Outlier Resilient Online Multivariate Change Point Detection Using Subsequence Divergence Estimation in Sensor Data Streams
    Dash, Ritwik
    Jenamani, Mamata
    [J]. IEEE Sensors Journal, 2024, 24 (23) : 39218 - 39229