An Efficient Anomaly Detection Approach Using Cube Sampling with Streaming Data

被引:0
|
作者
Jain, Seemandhar [1 ]
Jain, Prarthi [1 ]
Srivastava, Abhishek [1 ]
机构
[1] IIT Indore, Indore, India
关键词
Anomaly Detection; Isolation Forest; Cube Sampling; Sliding window; Streaming data;
D O I
10.1007/978-3-031-12700-7_51
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Anomaly detection is critical in various fields, including intrusion detection, health monitoring, fault diagnosis, and sensor network event detection. The isolation forest (or iForest) approach is a well-known technique for detecting anomalies. It is, however, ineffective when dealing with dynamic streaming data, which is becoming increasingly prevalent in a wide variety of application areas these days. In this work, we extend our previous work by proposed an efficient iForest based approach for anomaly detection using cube sampling that is effective on streaming data. Cube sampling is used in the initial stage to choose nearly balanced samples, significantly reducing storage requirements while preserving efficiency. Following that, the streaming nature of data is addressed by a sliding window technique that generates consecutive chunks of data for systematic processing. The novelty of this paper is in applying Cube sampling in iForest and calculating inclusion probability. The proposed approach is equally successful at detecting anomalies as existing state-of-the-art approaches, requiring significantly less storage and time complexity. We undertake empirical evaluations of the proposed approach using standard datasets and demonstrate that it outperforms traditional approaches in terms of Area Under the ROC Curve (AUC-ROC) and can handle high-dimensional streaming data.
引用
收藏
页码:498 / 505
页数:8
相关论文
共 50 条
  • [21] Correlated Anomaly Detection from Large Streaming Data
    Chen, Zheng
    Yu, Xinli
    Ling, Yuan
    Song, Bo
    Quan, Wei
    Hu, Xiaohua
    Yan, Erjia
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 982 - 992
  • [22] Real-time anomaly detection using parallelized intrusion detection architecture for streaming data
    Chellammal, P.
    Malarchelvi, Sheba Kezia P. D.
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2020, 32 (04):
  • [23] A hybrid approach for efficient anomaly detection using metaheuristic methods
    Ghanem, Tamer F.
    Elkilani, Wail S.
    Abdul-kader, Hatem M.
    JOURNAL OF ADVANCED RESEARCH, 2015, 6 (04) : 609 - 619
  • [24] Efficient Approach for Anomaly Detection in IoT Using System Calls
    Shamim, Nouman
    Asim, Muhammad
    Baker, Thar
    Awad, Ali Ismail
    SENSORS, 2023, 23 (02)
  • [25] Online and Unsupervised Anomaly Detection for Streaming Data Using an Array of Sliding Windows and PDDs
    Zhang, Lingyu
    Zhao, Jiabao
    Li, Wei
    IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (04) : 2284 - 2289
  • [26] A MODEL-BASED ANOMALY DETECTION APPROACH FOR ANALYZING STREAMING AIRCRAFT ENGINE MEASUREMENT DATA
    Simon, Donald L.
    Rinehart, Aidan W.
    PROCEEDINGS OF THE ASME TURBO EXPO: TURBINE TECHNICAL CONFERENCE AND EXPOSITION, 2014, VOL 6, 2014,
  • [27] An efficient shot boundary detection using data-cube searching technique
    Kavitha J.
    Arockia Jansi Rani P.
    Mohamed Fathimal P.
    Paul A.
    Recent Advances in Computer Science and Communications, 2020, 13 (04) : 799 - 808
  • [28] Waterloss detection in streaming water meter data using wavelet change-point anomaly detection
    Christodoulou, S. E.
    Kourti, E.
    Agathokleous, A.
    Christodoulou, C.
    EWORK AND EBUSINESS IN ARCHITECTURE, ENGINEERING AND CONSTRUCTION, 2016, : 613 - 618
  • [29] Streaming Anomaly Detection Using Randomized Matrix Sketching
    Huang, Hao
    Kasiviswanathan, Shiva Prasad
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2015, 9 (03): : 192 - 203
  • [30] Anomaly Detection for Streaming Data from Wearable Sensor Network
    Wang, Peipei
    Han, Yutong
    Qin, Jing
    Wang, Bin
    Yang, Xiaochun
    2017 IEEE 15TH INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, 15TH INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, 3RD INTL CONF ON BIG DATA INTELLIGENCE AND COMPUTING AND CYBER SCIENCE AND TECHNOLOGY CONGRESS(DASC/PICOM/DATACOM/CYBERSCI, 2017, : 263 - 268