STREAMRHF: Tree-Based Unsupervised Anomaly Detection for Data Streams

被引:0
|
作者
Nesic, Stefan [1 ]
Putina, Andrian [1 ]
Bahri, Maroua [2 ]
Huet, Alexis [3 ]
Navarro, Jose Manuel [3 ]
Rossi, Dario [3 ]
Sozio, Mauro [1 ]
机构
[1] Telecom Paris, Paris, France
[2] Inria Paris, Paris, France
[3] Huawei Technol Co Ltd, Paris, France
关键词
Data streams; Unsupervised learning; Anomaly detection; Random histogram;
D O I
10.1109/AICCSA56895.2022.10017876
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present STREAMRHF, an unsupervised anomaly detection algorithm for data streams. Our algorithm builds on some of the ideas of Random Histogram Forest (RHF) [1], a state-of-the-art algorithm for batch unsupervised anomaly detection. STREAMRHF constructs a forest of decision trees, where feature splits are determined according to the kurtosis score of every feature. It irrevocably assigns an anomaly score to data points, as soon as they arrive, by means of an incremental computation of its random trees and the kurtosis scores of the features. This allows efficient online scoring and concept drift detection altogether. Our approach is tree-based which boasts several appealing properties, such as explainability of the results [2]. We conduct an extensive experimental evaluation on multiple datasets from different real-world applications. Our evaluation shows that our streaming algorithm achieves comparable average precision to RHF while outperforming state-of-the-art streaming approaches for unsupervised anomaly detection with furthermore limited computational complexity.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] Unsupervised discretization using tree-based density estimation
    Schmidberger, G
    Frank, E
    KNOWLEDGE DISCOVERY IN DATABASES: PKDD 2005, 2005, 3721 : 240 - 251
  • [22] TeKET: a Tree-Based Unsupervised Keyphrase Extraction Technique
    Rabby, Gollam
    Azad, Saiful
    Mahmud, Mufti
    Zamli, Kamal Z.
    Rahman, Mohammed Mostafizur
    COGNITIVE COMPUTATION, 2020, 12 (04) : 811 - 833
  • [23] Cardiac anomaly detection based on time and frequency domain features using tree-based classifiers
    Kropf, M.
    Hayn, D.
    Morris, D.
    Radhakrishnan, Aravind-Kumar
    Belyayskiy, E.
    Frydas, A.
    Pieske-Kraigher, E.
    Pieske, B.
    Schreier, G.
    PHYSIOLOGICAL MEASUREMENT, 2018, 39 (11)
  • [24] Tree-based Self-adaptive Anomaly Detection by Human-Machine Interaction
    Li, Qingyang
    Yu, Zhiwen
    Xu, Huang
    Guo, Bin
    PROCEEDINGS OF THE 2021 IEEE INTERNATIONAL CONFERENCE ON HUMAN-MACHINE SYSTEMS (ICHMS), 2021, : 213 - 218
  • [25] Unsupervised density-based behavior change detection in data streams
    Vallim, Rosane M. M.
    Andrade Filho, Jose A.
    de Mello, Rodrigo F.
    de Carvalho, Andre C. P. L. F.
    Gama, Joao
    INTELLIGENT DATA ANALYSIS, 2014, 18 (02) : 181 - 201
  • [26] Anomaly Detection Guidelines for Data Streams in Big Data
    Rana, Annie Ibrahim
    Estrada, Giovani
    Sole, Marc
    Muntes, Victor
    2016 3RD INTERNATIONAL CONFERENCE ON SOFT COMPUTING & MACHINE INTELLIGENCE (ISCMI 2016), 2016, : 94 - 98
  • [27] Anomaly detection in injection molding process data based on unsupervised learning
    Schiffers, Reinhard
    Morik, Katharina
    Struchtrup, Alexander Schulze
    Honysz, Philipp-Jan
    Wortberg, Johannes
    Zeitschrift Kunststofftechnik/Journal of Plastics Technology, 2019, 2019 (05): : 301 - 347
  • [28] Unsupervised Anomaly Detection in Sequential Process Data
    Bulut, Okan
    Gorgun, Guher
    He, Surina
    ZEITSCHRIFT FUR PSYCHOLOGIE-JOURNAL OF PSYCHOLOGY, 2024, 232 (02): : 74 - 94
  • [29] Unsupervised Anomaly Detection in Data Quality Control
    Poon, Lex
    Farshidi, Siamak
    Li, Na
    Zhao, Zhiming
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 2327 - 2336
  • [30] Transformer-Based Method for Unsupervised Anomaly Detection of Flight Data
    Yu, Hao
    Wu, Honglan
    Sun, Youchao
    Liu, Hao
    2023 ASIA-PACIFIC INTERNATIONAL SYMPOSIUM ON AEROSPACE TECHNOLOGY, VOL I, APISAT 2023, 2024, 1050 : 1816 - 1826