Fast anomaly detection with locality-sensitive hashing and hyperparameter autotuning

被引:6
|
作者
Meira, Jorge [1 ,2 ]
Eiras-Franco, Carlos [1 ]
Bolon-Canedo, Veronica [1 ]
Marreiros, Goreti [2 ]
Alonso-Betanzos, Amparo [1 ]
机构
[1] Univ A Coruna, CITIC, La Coruna 15071, Spain
[2] Inst Engn Polytech Porto ISEP IPP, GECAD, Porto, Portugal
关键词
Anomaly detection; Unsupervised learning; AutoML; Scalability; Big data; OUTLIER DETECTION; NETWORK;
D O I
10.1016/j.ins.2022.06.035
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents LSHAD, an anomaly detection (AD) method based on Locality Sensitive Hashing (LSH), capable of dealing with large-scale datasets. The resulting algorithm is highly parallelizable and its implementation in Apache Spark further increases its ability to handle very large datasets. Moreover, the algorithm incorporates an automatic hyperparameter tuning mechanism so that users do not have to implement costly manual tuning. Our LSHAD method is novel as both hyperparameter automation and distributed properties are not usual in AD techniques. Our results for experiments with LSHAD across a variety of datasets point to state-of-the-art AD performance while handling much larger datasets than state-of-the-art alternatives. In addition, evaluation results for the tradeoff between AD performance and scalability show that our method offers significant advantages over competing methods. (C) 2022 Elsevier Inc. All rights reserved.
引用
收藏
页码:1245 / 1264
页数:20
相关论文
共 50 条
  • [31] Fast Low-Rank Matrix Approximation with Locality Sensitive Hashing for Quick Anomaly Detection
    Xie, Gaogang
    Xie, Kun
    Huang, Jun
    Wang, Xin
    Chen, Yuxiang
    Wen, Jigang
    IEEE INFOCOM 2017 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS, 2017,
  • [32] Arrays of (locality-sensitive) Count Estimators (ACE): Anomaly Detection on the Edge
    Luo, Chen
    Shrivastava, Anshumali
    WEB CONFERENCE 2018: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW2018), 2018, : 1439 - 1448
  • [33] Kernelized Locality-Sensitive Hashing for Scalable Image Search
    Kulis, Brian
    Grauman, Kristen
    2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, : 2130 - 2137
  • [35] Analysis of Locality-Sensitive Hashing for Fast Critical Event Prediction on Physiological Time Series
    Kim, Yongwook Bryce
    O'Reilly, Una-May
    2016 38TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2016, : 783 - 787
  • [36] An incremental community detection method for social tagging systems using locality-sensitive hashing
    Wu, Zhenyu
    Zou, Ming
    NEURAL NETWORKS, 2014, 58 : 14 - 28
  • [37] On the Problem of p1-1 in Locality-Sensitive Hashing
    Ahle, Thomas Dybdahl
    SIMILARITY SEARCH AND APPLICATIONS, SISAP 2020, 2020, 12440 : 85 - 93
  • [38] An improved method of locality-sensitive hashing for scalable instance matching
    Mehmet Aydar
    Serkan Ayvaz
    Knowledge and Information Systems, 2019, 58 : 275 - 294
  • [39] Digital Watermarks for Videos Based on a Locality-Sensitive Hashing Algorithm
    Sun, Yajuan
    Srivastava, Gautam
    MOBILE NETWORKS & APPLICATIONS, 2023, 28 (05): : 1724 - 1737
  • [40] Frequent-Itemset Mining Using Locality-Sensitive Hashing
    Bera, Debajyoti
    Pratap, Rameshwar
    COMPUTING AND COMBINATORICS, COCOON 2016, 2016, 9797 : 143 - 155