An Efficient Anomaly Detection Approach Using Cube Sampling with Streaming Data

被引:0
|
作者
Jain, Seemandhar [1 ]
Jain, Prarthi [1 ]
Srivastava, Abhishek [1 ]
机构
[1] IIT Indore, Indore, India
关键词
Anomaly Detection; Isolation Forest; Cube Sampling; Sliding window; Streaming data;
D O I
10.1007/978-3-031-12700-7_51
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Anomaly detection is critical in various fields, including intrusion detection, health monitoring, fault diagnosis, and sensor network event detection. The isolation forest (or iForest) approach is a well-known technique for detecting anomalies. It is, however, ineffective when dealing with dynamic streaming data, which is becoming increasingly prevalent in a wide variety of application areas these days. In this work, we extend our previous work by proposed an efficient iForest based approach for anomaly detection using cube sampling that is effective on streaming data. Cube sampling is used in the initial stage to choose nearly balanced samples, significantly reducing storage requirements while preserving efficiency. Following that, the streaming nature of data is addressed by a sliding window technique that generates consecutive chunks of data for systematic processing. The novelty of this paper is in applying Cube sampling in iForest and calculating inclusion probability. The proposed approach is equally successful at detecting anomalies as existing state-of-the-art approaches, requiring significantly less storage and time complexity. We undertake empirical evaluations of the proposed approach using standard datasets and demonstrate that it outperforms traditional approaches in terms of Area Under the ROC Curve (AUC-ROC) and can handle high-dimensional streaming data.
引用
收藏
页码:498 / 505
页数:8
相关论文
共 50 条
  • [1] An Efficient Anomaly Detection Framework for Electromagnetic Streaming Data
    Sun, Degang
    Hu, Yulan
    Shi, Zhixin
    Xu, Guokun
    Zhou, Wei
    ICBDC 2019: PROCEEDINGS OF 2019 4TH INTERNATIONAL CONFERENCE ON BIG DATA AND COMPUTING, 2019, : 151 - 155
  • [2] ANOMALY DETECTION AND CLASSIFICATION FOR STREAMING DATA USING PDES
    Abbasi, Bilal
    Calder, Jeff
    Oberman, Adam M.
    SIAM JOURNAL ON APPLIED MATHEMATICS, 2018, 78 (02) : 921 - 941
  • [3] Anomaly Detection in Streaming Data using Isolation Forest
    Kareem, Mohammed Shaker
    Muhammed, Lamia AbedNoor
    PROCEEDINGS 2024 SEVENTH INTERNATIONAL WOMEN IN DATA SCIENCE CONFERENCE AT PRINCE SULTAN UNIVERSITY, WIDS-PSU 2024, 2024, : 223 - 228
  • [4] Autonomous anomaly detection for streaming data
    Basheer, Muhammad Yunus Iqbal
    Ali, Azliza Mohd
    Hamid, Nurzeatul Hamimah Abdul
    Ariffin, Muhammad Azizi Mohd
    Osman, Rozianawaty
    Nordin, Sharifalillah
    Gu, Xiaowei
    KNOWLEDGE-BASED SYSTEMS, 2024, 284
  • [5] Anomaly pattern detection for streaming data
    Kim, Taegong
    Park, Cheong Hee
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 149
  • [6] Anomaly and Degradation Detection Using Subspace Tracking in Streaming Data
    Cha, Kyungduck
    Sadek, Carol
    Asgharzadeh, Zohreh
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 3476 - 3481
  • [7] Data-Driven Anomaly Detection Approach for Time-Series Streaming Data
    Zhang, Minghu
    Guo, Jianwen
    Li, Xin
    Jin, Rui
    SENSORS, 2020, 20 (19) : 1 - 17
  • [8] Towards Efficient Data Sampling for Temporal Anomaly Detection in Sensor Networks
    El Sibai, Rayane
    Chabchoub, Yousra
    Abou Jaoude, Chady
    Demerjian, Jacques
    Togbe, Maurras
    2019 2ND IEEE MIDDLE EAST AND NORTH AFRICA COMMUNICATIONS CONFERENCE (IEEEMENACOMM'19), 2019, : 111 - 116
  • [9] Anomaly detection in streaming environmental sensor data: A data-driven modeling approach
    Hill, David J.
    Minsker, Barbara S.
    ENVIRONMENTAL MODELLING & SOFTWARE, 2010, 25 (09) : 1014 - 1022
  • [10] Advanced Memory Efficient Outlier Detection Approach for Streaming Data using Swarm Optimization
    Karale, Ankita
    Lazarova, Milena
    Koleva, Pavlina
    Poulkov, Vladimir
    2021 44TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2021, : 346 - 351