Fast anomaly detection with locality-sensitive hashing and hyperparameter autotuning

被引:6
|
作者
Meira, Jorge [1 ,2 ]
Eiras-Franco, Carlos [1 ]
Bolon-Canedo, Veronica [1 ]
Marreiros, Goreti [2 ]
Alonso-Betanzos, Amparo [1 ]
机构
[1] Univ A Coruna, CITIC, La Coruna 15071, Spain
[2] Inst Engn Polytech Porto ISEP IPP, GECAD, Porto, Portugal
关键词
Anomaly detection; Unsupervised learning; AutoML; Scalability; Big data; OUTLIER DETECTION; NETWORK;
D O I
10.1016/j.ins.2022.06.035
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents LSHAD, an anomaly detection (AD) method based on Locality Sensitive Hashing (LSH), capable of dealing with large-scale datasets. The resulting algorithm is highly parallelizable and its implementation in Apache Spark further increases its ability to handle very large datasets. Moreover, the algorithm incorporates an automatic hyperparameter tuning mechanism so that users do not have to implement costly manual tuning. Our LSHAD method is novel as both hyperparameter automation and distributed properties are not usual in AD techniques. Our results for experiments with LSHAD across a variety of datasets point to state-of-the-art AD performance while handling much larger datasets than state-of-the-art alternatives. In addition, evaluation results for the tradeoff between AD performance and scalability show that our method offers significant advantages over competing methods. (C) 2022 Elsevier Inc. All rights reserved.
引用
收藏
页码:1245 / 1264
页数:20
相关论文
共 50 条
  • [41] Locality-Sensitive Hashing for Finding Nearest Neighbors in Probability Distributions
    Tang, Yi-Kun
    Mao, Xian-Ling
    Hao, Yi-Jing
    Xu, Cheng
    Huang, Heyan
    SOCIAL MEDIA PROCESSING, SMP 2017, 2017, 774 : 3 - 15
  • [42] Can LSH (locality-sensitive hashing) be replaced by neural network?
    Liu, Renyang
    Zhao, Jun
    Chu, Xing
    Liang, Yu
    Zhou, Wei
    He, Jing
    SOFT COMPUTING, 2024, 28 (02) : 887 - 902
  • [43] A Scalable ECG Identification System Based on Locality-Sensitive Hashing
    Chu, Hui-Yu
    Lin, Tzu-Yun
    Lee, Song-Hong
    Chiu, Jui-Kun
    Nien, Cing-Ping
    Wu, Shun-Chi
    2023 45TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY, EMBC, 2023,
  • [44] Similar Pair Identification using Locality-Sensitive Hashing Technique
    Lee, Kyung Mi
    Lee, Keon Myung
    6TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS, AND THE 13TH INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS, 2012, : 2117 - 2119
  • [45] A Fast and Memory-Efficient Spectral Library Search Algorithm Using Locality-Sensitive Hashing
    Wang, Lei
    Liu, Kaiyuan
    Li, Sujun
    Tang, Haixu
    PROTEOMICS, 2020, 20 (21-22)
  • [46] CONSULT: accurate contamination removal using locality-sensitive hashing
    Rachtman, Eleonora
    Bafna, Vineet
    Mirarab, Siavash
    NAR GENOMICS AND BIOINFORMATICS, 2021, 3 (03)
  • [47] Can LSH (locality-sensitive hashing) be replaced by neural network?
    Renyang Liu
    Jun Zhao
    Xing Chu
    Yu Liang
    Wei Zhou
    Jing He
    Soft Computing, 2024, 28 : 1041 - 1053
  • [48] A novel locality-sensitive hashing for large scale image retrieva
    Li, Junyi
    Li, Jianhua
    Ni, Bingbing
    Yan, Shuicheng
    Journal of Computational Information Systems, 2012, 8 (23): : 9611 - 9617
  • [49] Fast Distributed kNN Graph Construction Using Auto-tuned Locality-sensitive Hashing
    Eiras-Franco, Carlos
    Martinez-Rego, David
    Kanthan, Leslie
    Pineiro, Cesar
    Bahamonde, Antonio
    Guijarro-Berdinas, Bertha
    Alonso-Betanzos, Amparo
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2020, 11 (06)
  • [50] Maintaining Academic Integrity in Programming: Locality-Sensitive Hashing and Recommendations
    Karnalim, Oscar
    EDUCATION SCIENCES, 2023, 13 (01):