RADE: resource-efficient supervised anomaly detection using decision tree-based ensemble methods

被引:8
|
作者
Vargaftik, Shay [1 ]
Keslassy, Isaac [1 ,2 ]
Orda, Ariel [2 ]
Ben-Itzhak, Yaniv [1 ]
机构
[1] VMware Res, Palo Alto, CA 94304 USA
[2] Technion, Haifa, Israel
基金
以色列科学基金会;
关键词
Resource efficient machine learning; Fast machine learning; Anomaly detection; Supervised learning; Decision-tree based ensemble methods; SMOTE;
D O I
10.1007/s10994-021-06047-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The capability to perform anomaly detection in a resource-constrained setting, such as an edge device or a loaded server, is of increasing need due to emerging on-premises computation constraints as well as security, privacy and profitability reasons. Yet, the increasing size of datasets often results in current anomaly detection methods being too resource consuming, and in particular decision-tree based ensemble classifiers. To address this need, we present RADE-a new resource-efficient anomaly detection framework that augments standard decision-tree based ensemble classifiers to perform well in a resource constrained setting. The key idea behind RADE is first to train a small model that is sufficient to correctly classify the majority of the queries. Then, using only subsets of the training data, train expert models for these fewer harder cases where the small model is at high risk of making a classification mistake. We implement RADE as a scikit-learn classifier. Our evaluation indicates that RADE offers competitive anomaly detection capabilities as compared to standard methods while significantly improving memory footprint by up to 12x, training-time by up to 20x, and classification time by up to 16x.
引用
收藏
页码:2835 / 2866
页数:32
相关论文
共 50 条
  • [1] RADE: resource-efficient supervised anomaly detection using decision tree-based ensemble methods
    Shay Vargaftik
    Isaac Keslassy
    Ariel Orda
    Yaniv Ben-Itzhak
    Machine Learning, 2021, 110 : 2835 - 2866
  • [2] Tree-based algorithms for weakly supervised anomaly detection
    Finke, Thorben
    Hein, Marie
    Kasieczka, Gregor
    Kraemer, Michael
    Mueck, Alexander
    Prangchaikul, Parada
    Quadfasel, Tobias
    Shih, David
    Sommerhalder, Manuel
    PHYSICAL REVIEW D, 2024, 109 (03)
  • [3] Resource-Efficient Parallel Tree-Based Join Architecture on FPGA
    Zhang, Huan
    Zhao, Bei
    Li, Wei-Jun
    Ma, Zhen-Guo
    Yu, Feng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2019, 66 (01) : 111 - 115
  • [4] Postprocessing of Ensemble Weather Forecast Using Decision Tree-Based Probabilistic Forecasting Methods
    Benacek, Patrik
    Farda, Ales
    Stepanek, Petr
    WEATHER AND FORECASTING, 2023, 38 (01) : 69 - 82
  • [5] Supervised learning with decision tree-based methods in computational and systems biology
    Geurts, Pierre
    Irrthum, Alexandre
    Wehenkel, Louis
    MOLECULAR BIOSYSTEMS, 2009, 5 (12) : 1593 - 1605
  • [6] Enhanced Tree-Based Anomaly Detection
    Karczmarek, Pawel
    Galka, Lukasz
    Dolecki, Michal
    Pedrycz, Witold
    Czerwinski, Dariusz
    Kiersztyn, Adam
    Stegierski, Rafal
    2022 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2022,
  • [7] Classification of repeated measurements data using tree-based ensemble methods
    Werner Adler
    Sergej Potapov
    Berthold Lausen
    Computational Statistics, 2011, 26
  • [8] Predicting musculoskeletal disorders risk using tree-based ensemble methods
    Paraponaris, A.
    Ba, A.
    Gallic, E.
    Liance, Q.
    Michel, Pierre
    EUROPEAN JOURNAL OF PUBLIC HEALTH, 2019, 29
  • [9] Classification of repeated measurements data using tree-based ensemble methods
    Adler, Werner
    Potapov, Sergej
    Lausen, Berthold
    COMPUTATIONAL STATISTICS, 2011, 26 (02) : 355 - 369
  • [10] Automatic feature subset selection for decision tree-based ensemble methods in the prediction of bioactivity
    Cao, Dong-Sheng
    Xu, Qing-Song
    Liang, Yi-Zeng
    Chen, Xian
    Li, Hong-Dong
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2010, 103 (02) : 129 - 136