STREAMRHF: Tree-Based Unsupervised Anomaly Detection for Data Streams

被引:0
|
作者
Nesic, Stefan [1 ]
Putina, Andrian [1 ]
Bahri, Maroua [2 ]
Huet, Alexis [3 ]
Navarro, Jose Manuel [3 ]
Rossi, Dario [3 ]
Sozio, Mauro [1 ]
机构
[1] Telecom Paris, Paris, France
[2] Inria Paris, Paris, France
[3] Huawei Technol Co Ltd, Paris, France
关键词
Data streams; Unsupervised learning; Anomaly detection; Random histogram;
D O I
10.1109/AICCSA56895.2022.10017876
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present STREAMRHF, an unsupervised anomaly detection algorithm for data streams. Our algorithm builds on some of the ideas of Random Histogram Forest (RHF) [1], a state-of-the-art algorithm for batch unsupervised anomaly detection. STREAMRHF constructs a forest of decision trees, where feature splits are determined according to the kurtosis score of every feature. It irrevocably assigns an anomaly score to data points, as soon as they arrive, by means of an incremental computation of its random trees and the kurtosis scores of the features. This allows efficient online scoring and concept drift detection altogether. Our approach is tree-based which boasts several appealing properties, such as explainability of the results [2]. We conduct an extensive experimental evaluation on multiple datasets from different real-world applications. Our evaluation shows that our streaming algorithm achieves comparable average precision to RHF while outperforming state-of-the-art streaming approaches for unsupervised anomaly detection with furthermore limited computational complexity.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] Tree-based indexes for image data
    Brown, L
    Gruenwald, L
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 1998, 9 (04) : 300 - 313
  • [42] Tree-based boosting with functional data
    Ju, Xiaomeng
    Salibian-Barrera, Matias
    COMPUTATIONAL STATISTICS, 2024, 39 (03) : 1587 - 1620
  • [43] Tree-based boosting with functional data
    Xiaomeng Ju
    Matías Salibián-Barrera
    Computational Statistics, 2024, 39 : 1587 - 1620
  • [44] Tree-Based Models for Correlated Data
    Rabinowicz, Assaf
    Rosset, Saharon
    Journal of Machine Learning Research, 2022, 23
  • [45] Tree-Based Models for Correlated Data
    Rabinowicz, Assaf
    Rosset, Saharon
    JOURNAL OF MACHINE LEARNING RESEARCH, 2022, 23
  • [46] HYPERSPECTRAL ANOMALY DETECTION WITH DATA SPHERING AND UNSUPERVISED TARGET DETECTION
    Chen, Shuhan
    Li, Xiaorun
    Zhao, Liaoying
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 1975 - 1978
  • [47] Change detection in data streams through unsupervised learning
    Cabanes, Guenael
    Bennani, Younes
    2012 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2012,
  • [48] Unsupervised LSTMs-based Learning for Anomaly Detection in Highway Traffic Data
    Di Mauro, Nicola
    Ferilli, Stefano
    FOUNDATIONS OF INTELLIGENT SYSTEMS (ISMIS 2018), 2018, 11177 : 281 - 290
  • [49] A decision tree-based multimodal data mining framework for soccer goal detection
    Chen, SC
    Shyu, ML
    Chen, M
    Zhang, CC
    2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3, 2004, : 265 - 268
  • [50] Data Streams Anomaly Detection Algorithm Based on Self-set Threshold
    Luo Yuanyan
    Du Xuehui
    Sun Yi
    PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON COMMUNICATION AND INFORMATION PROCESSING (ICCIP 2018), 2018, : 18 - 26