STREAMRHF: Tree-Based Unsupervised Anomaly Detection for Data Streams

被引:0
|
作者
Nesic, Stefan [1 ]
Putina, Andrian [1 ]
Bahri, Maroua [2 ]
Huet, Alexis [3 ]
Navarro, Jose Manuel [3 ]
Rossi, Dario [3 ]
Sozio, Mauro [1 ]
机构
[1] Telecom Paris, Paris, France
[2] Inria Paris, Paris, France
[3] Huawei Technol Co Ltd, Paris, France
关键词
Data streams; Unsupervised learning; Anomaly detection; Random histogram;
D O I
10.1109/AICCSA56895.2022.10017876
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present STREAMRHF, an unsupervised anomaly detection algorithm for data streams. Our algorithm builds on some of the ideas of Random Histogram Forest (RHF) [1], a state-of-the-art algorithm for batch unsupervised anomaly detection. STREAMRHF constructs a forest of decision trees, where feature splits are determined according to the kurtosis score of every feature. It irrevocably assigns an anomaly score to data points, as soon as they arrive, by means of an incremental computation of its random trees and the kurtosis scores of the features. This allows efficient online scoring and concept drift detection altogether. Our approach is tree-based which boasts several appealing properties, such as explainability of the results [2]. We conduct an extensive experimental evaluation on multiple datasets from different real-world applications. Our evaluation shows that our streaming algorithm achieves comparable average precision to RHF while outperforming state-of-the-art streaming approaches for unsupervised anomaly detection with furthermore limited computational complexity.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] Unsupervised Anomaly Detection on Temporal Multiway Data
    Duc Nguyen
    Phuoc Nguyen
    Kien Do
    Rana, Santu
    Gupta, Sunil
    Truyen Tran
    2020 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2020, : 1059 - 1066
  • [32] SoftPatch: Unsupervised Anomaly Detection with Noisy Data
    Jiang, Xi
    Liu, Jianlin
    Wang, Jinbao
    Nie, Qian
    Wu, Kai
    Liu, Yong
    Wang, Chengjie
    Zheng, Feng
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [33] Tree-Based Cost Sensitive Methods for Fraud Detection in Imbalanced Data
    Metzler, Guillaume
    Badiche, Xavier
    Belkasmi, Brahim
    Fromont, Elisa
    Habrard, Amaury
    Sebban, Marc
    ADVANCES IN INTELLIGENT DATA ANALYSIS XVII, IDA 2018, 2018, 11191 : 213 - 224
  • [34] Outlier and anomaly pattern detection on data streams
    Cheong Hee Park
    The Journal of Supercomputing, 2019, 75 : 6118 - 6128
  • [35] Anomaly Detection on Data Streams for Smart Agriculture
    Moso, Juliet Chebet
    Cormier, Stephane
    de Runz, Cyril
    Fouchal, Hacene
    Wandeto, John Mwangi
    AGRICULTURE-BASEL, 2021, 11 (11):
  • [36] OHODIN - Online Anomaly Detection for Data Streams
    Gruhl, Christian
    Tomforde, Sven
    2021 IEEE INTERNATIONAL CONFERENCE ON AUTONOMIC COMPUTING AND SELF-ORGANIZING SYSTEMS COMPANION (ACSOS-C 2021), 2021, : 193 - 197
  • [37] Review of Anomaly Detection Algorithms for Data Streams
    Lu, Tianyuan
    Wang, Lei
    Zhao, Xiaoyong
    APPLIED SCIENCES-BASEL, 2023, 13 (10):
  • [38] Adaptive Anomaly Detection on Network Data Streams
    Riddle-Workman, Elizabeth
    Evangelou, Marina
    Adams, Niall M.
    2018 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENCE AND SECURITY INFORMATICS (ISI), 2018, : 19 - 24
  • [39] Outlier and anomaly pattern detection on data streams
    Park, Cheong Hee
    JOURNAL OF SUPERCOMPUTING, 2019, 75 (09): : 6118 - 6128
  • [40] Anomaly Detection in Data Streams Based on Graph Coloring Density Coefficients
    Tripathi, Achyut Mani
    Baruah, Rashmi Dutta
    PROCEEDINGS OF 2016 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2016,