Fast anomaly detection with locality-sensitive hashing and hyperparameter autotuning

被引:6
|
作者
Meira, Jorge [1 ,2 ]
Eiras-Franco, Carlos [1 ]
Bolon-Canedo, Veronica [1 ]
Marreiros, Goreti [2 ]
Alonso-Betanzos, Amparo [1 ]
机构
[1] Univ A Coruna, CITIC, La Coruna 15071, Spain
[2] Inst Engn Polytech Porto ISEP IPP, GECAD, Porto, Portugal
关键词
Anomaly detection; Unsupervised learning; AutoML; Scalability; Big data; OUTLIER DETECTION; NETWORK;
D O I
10.1016/j.ins.2022.06.035
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents LSHAD, an anomaly detection (AD) method based on Locality Sensitive Hashing (LSH), capable of dealing with large-scale datasets. The resulting algorithm is highly parallelizable and its implementation in Apache Spark further increases its ability to handle very large datasets. Moreover, the algorithm incorporates an automatic hyperparameter tuning mechanism so that users do not have to implement costly manual tuning. Our LSHAD method is novel as both hyperparameter automation and distributed properties are not usual in AD techniques. Our results for experiments with LSHAD across a variety of datasets point to state-of-the-art AD performance while handling much larger datasets than state-of-the-art alternatives. In addition, evaluation results for the tradeoff between AD performance and scalability show that our method offers significant advantages over competing methods. (C) 2022 Elsevier Inc. All rights reserved.
引用
收藏
页码:1245 / 1264
页数:20
相关论文
共 50 条
  • [21] Fast distributed video deduplication via locality-sensitive hashing with similarity ranking
    Li, Yeguang
    Hu, Liang
    Xia, Ke
    Luo, Jie
    EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2019, 2019 (1)
  • [22] Fast distributed video deduplication via locality-sensitive hashing with similarity ranking
    Yeguang Li
    Liang Hu
    Ke Xia
    Jie Luo
    EURASIP Journal on Image and Video Processing, 2019
  • [23] Fast alignment filtering of nanopore sequencing reads using locality-sensitive hashing
    Wang, Jeremy R.
    Jones, Corbin D.
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2015, : 127 - 130
  • [24] Efficient Outlier Detection in Hyperedge Streams Using MinHash and Locality-Sensitive Hashing
    Ranshous, Stephen
    Chaudhary, Mandar
    Samatova, Nagiza F.
    COMPLEX NETWORKS & THEIR APPLICATIONS VI, 2018, 689 : 105 - 116
  • [25] A Locality-Sensitive Hashing-Based Jamming Detection System for IoT Networks
    Ganeshkumar, P.
    Albalawi, Talal
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 73 (03): : 5943 - 5959
  • [26] Fast Duplicate Detection Using Locality Sensitive Hashing
    Rong, C. T.
    Feng, L. J.
    INTERNATIONAL CONFERENCE ON ADVANCED EDUCATIONAL TECHNOLOGY AND INFORMATION ENGINEERING (AETIE 2015), 2015, : 580 - 588
  • [27] Video anomaly detection based on locality sensitive hashing filters
    Zhang, Ying
    Lu, Huchuan
    Zhang, Lihe
    Ruan, Xiang
    Sakai, Shun
    PATTERN RECOGNITION, 2016, 59 : 302 - 311
  • [28] Accurate and Fast Asymmetric Locality-Sensitive Hashing Scheme for Maximum Inner Product Search
    Huang, Qiang
    Ma, Guihong
    Feng, Jianlin
    Fang, Qiong
    Tung, Anthony K. H.
    KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 1561 - 1570
  • [29] Locality-Sensitive Hashing for Chi2 Distance
    Gorisse, David
    Cord, Matthieu
    Precioso, Frederic
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (02) : 402 - 409
  • [30] Locality-Sensitive Hashing Without False Negatives for lp
    Pacuk, Andrzej
    Sankowski, Piotr
    Wegrzycki, Karol
    Wygocki, Piotr
    COMPUTING AND COMBINATORICS, COCOON 2016, 2016, 9797 : 105 - 118