An Efficient Anomaly Detection Approach Using Cube Sampling with Streaming Data

被引:0
|
作者
Jain, Seemandhar [1 ]
Jain, Prarthi [1 ]
Srivastava, Abhishek [1 ]
机构
[1] IIT Indore, Indore, India
关键词
Anomaly Detection; Isolation Forest; Cube Sampling; Sliding window; Streaming data;
D O I
10.1007/978-3-031-12700-7_51
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Anomaly detection is critical in various fields, including intrusion detection, health monitoring, fault diagnosis, and sensor network event detection. The isolation forest (or iForest) approach is a well-known technique for detecting anomalies. It is, however, ineffective when dealing with dynamic streaming data, which is becoming increasingly prevalent in a wide variety of application areas these days. In this work, we extend our previous work by proposed an efficient iForest based approach for anomaly detection using cube sampling that is effective on streaming data. Cube sampling is used in the initial stage to choose nearly balanced samples, significantly reducing storage requirements while preserving efficiency. Following that, the streaming nature of data is addressed by a sliding window technique that generates consecutive chunks of data for systematic processing. The novelty of this paper is in applying Cube sampling in iForest and calculating inclusion probability. The proposed approach is equally successful at detecting anomalies as existing state-of-the-art approaches, requiring significantly less storage and time complexity. We undertake empirical evaluations of the proposed approach using standard datasets and demonstrate that it outperforms traditional approaches in terms of Area Under the ROC Curve (AUC-ROC) and can handle high-dimensional streaming data.
引用
收藏
页码:498 / 505
页数:8
相关论文
共 50 条
  • [41] An Efficient Approach for Anomaly Detection in Traffic Videos
    Doshi, Keval
    Yilmaz, Yasin
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 4231 - 4239
  • [42] Efficient balanced sampling:: The cube method
    Deville, JC
    Tillé, Y
    BIOMETRIKA, 2004, 91 (04) : 893 - 912
  • [43] An efficient data structure for network anomaly detection
    Fan, Jieyan
    Wu, Dapeng
    Lu, Kejie
    Nucci, Antonio
    SECURITY AND COMMUNICATION NETWORKS, 2008, 1 (02) : 107 - 124
  • [44] Super point detection based on sampling and data streaming algorithms
    School of Computer Science and Engineering, Southeast University, Nanjing 210096, China
    不详
    J. Southeast Univ. Engl. Ed., 2009, 2 (224-227):
  • [45] Real-time anomaly detection in gas sensor streaming data
    Wu, Haibo
    Shi, Shiliang
    INTERNATIONAL JOURNAL OF EMBEDDED SYSTEMS, 2021, 14 (01) : 81 - 88
  • [46] Querying Streaming System Monitoring Data for Enterprise System Anomaly Detection
    Gao, Peng
    Xiao, Xusheng
    Li, Ding
    Jee, Kangkook
    Chen, Haifeng
    Kulkarni, Sanjeev R.
    Mittal, Prateek
    2020 IEEE 36TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2020), 2020, : 1774 - 1777
  • [47] Real-time Bayesian anomaly detection in streaming environmental data
    Hill, David J.
    Minsker, Barbara S.
    Amir, Eyal
    WATER RESOURCES RESEARCH, 2009, 45
  • [48] ANOMALY DETECTION AND IDENTIFICATION USING VISUAL TECHNIQUES IN STREAMING VIDEO
    Wanigaaratchi, T. A.
    Vidanagama, V. G. T. N.
    2020 11TH IEEE ANNUAL UBIQUITOUS COMPUTING, ELECTRONICS & MOBILE COMMUNICATION CONFERENCE (UEMCON), 2020, : 45 - 51
  • [49] ADVERSARIAL ANOMALY DETECTION FOR MARKED SPATIO-TEMPORAL STREAMING DATA
    Zhu, Shixiang
    Yuchi, Henry Shaowu
    Xie, Yao
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 8921 - 8925
  • [50] A Streaming Data Anomaly Detection Analytic Engine for Mobile Network Management
    Wang, MingXue
    Handurukande, Sidath
    2016 INT IEEE CONFERENCES ON UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING AND COMMUNICATIONS, CLOUD AND BIG DATA COMPUTING, INTERNET OF PEOPLE, AND SMART WORLD CONGRESS (UIC/ATC/SCALCOM/CBDCOM/IOP/SMARTWORLD), 2016, : 722 - 729