A Comparative Evaluation of Unsupervised Anomaly Detection Algorithms for Multivariate Data

被引:544
|
作者
Goldstein, Markus [1 ]
Uchida, Seiichi [2 ]
机构
[1] Kyushu Univ, Ctr Coevolut Social Syst Innovat, Fukuoka 812, Japan
[2] Kyushu Univ, Dept Adv Informat Technol, Fukuoka 812, Japan
来源
PLOS ONE | 2016年 / 11卷 / 04期
基金
日本科学技术振兴机构;
关键词
NOVELTY DETECTION;
D O I
10.1371/journal.pone.0152173
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Anomaly detection is the process of identifying unexpected items or events in datasets, which differ from the norm. In contrast to standard classification tasks, anomaly detection is often applied on unlabeled data, taking only the internal structure of the dataset into account. This challenge is known as unsupervised anomaly detection and is addressed in many practical applications, for example in network intrusion detection, fraud detection as well as in the life science and medical domain. Dozens of algorithms have been proposed in this area, but unfortunately the research community still lacks a comparative universal evaluation as well as common publicly available datasets. These shortcomings are addressed in this study, where 19 different unsupervised anomaly detection algorithms are evaluated on 10 different datasets from multiple application domains. By publishing the source code and the datasets, this paper aims to be a new well-funded basis for unsupervised anomaly detection research. Additionally, this evaluation reveals the strengths and weaknesses of the different approaches for the first time. Besides the anomaly detection performance, computational effort, the impact of parameter settings as well as the global/local anomaly detection behavior is outlined. As a conclusion, we give an advise on algorithm selection for typical real-world tasks.
引用
收藏
页数:31
相关论文
共 50 条
  • [1] Comparative Analysis of Unsupervised Machine Learning Algorithms for Anomaly Detection in Network Data
    Oliveira, Junia Maisa
    Almeida, Jonatan
    Macedo, Daniel
    Nogueira, Jose Marcos
    2023 IEEE LATIN-AMERICAN CONFERENCE ON COMMUNICATIONS, LATINCOM, 2023,
  • [2] Unsupervised Anomaly Detection for Multivariate Incomplete Data using GAN-based Data Imputation: A Comparative Study
    Sarda, Kisan
    Yerudkar, Amol
    Del Vecchio, Carmen
    2023 31ST MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION, MED, 2023, : 55 - 60
  • [3] A Comparative Evaluation of SOM-based Anomaly Detection Methods for Multivariate Data
    Guo, Bingjun
    Song, Lei
    Zheng, Taisheng
    Liang, Haoran
    Wang, Hongfei
    2019 PROGNOSTICS AND SYSTEM HEALTH MANAGEMENT CONFERENCE (PHM-QINGDAO), 2019,
  • [4] Study and Evaluation of Unsupervised Algorithms Used in Network Anomaly Detection
    Dromard, Juliette
    Owezarski, Philippe
    PROCEEDINGS OF THE FUTURE TECHNOLOGIES CONFERENCE (FTC) 2019, VOL 2, 2020, 1070 : 397 - 416
  • [5] Anomaly detection in predictive maintenance: A new evaluation framework for temporal unsupervised anomaly detection algorithms
    Carrasco, Jacinto
    López, David
    Aguilera-Martos, Ignacio
    García-Gil, Diego
    Markova, Irina
    García-Barzana, Marta
    Arias-Rodil, Manuel
    Luengo, Julián
    Herrera, Francisco
    Neurocomputing, 2021, 462 : 440 - 452
  • [6] Anomaly detection in predictive maintenance: A new evaluation framework for temporal unsupervised anomaly detection algorithms
    Carrasco, Jacinto
    Lopez, David
    Aguilera-Martos, Ignacio
    Garcia-Gil, Diego
    Markova, Irina
    Garcia-Barzana, Marta
    Arias-Rodil, Manuel
    Luengo, Julian
    Herrera, Francisco
    NEUROCOMPUTING, 2021, 462 : 440 - 452
  • [7] Unsupervised Deep Anomaly Detection for Industrial Multivariate Time Series Data
    Liu, Wenqiang
    Yan, Li
    Ma, Ningning
    Wang, Gaozhou
    Ma, Xiaolong
    Liu, Peishun
    Tang, Ruichun
    APPLIED SCIENCES-BASEL, 2024, 14 (02):
  • [8] Ensemble Algorithms for Unsupervised Anomaly Detection
    Zhao, Zhiruo
    Mehrotra, Kishan G.
    Mohan, Chilukuri K.
    CURRENT APPROACHES IN APPLIED ARTIFICIAL INTELLIGENCE, 2015, 9101 : 514 - 525
  • [9] On Algorithms Selection for Unsupervised Anomaly Detection
    Zoppi, Tommaso
    Ceccarelli, Andrea
    Bondavalli, Andrea
    2018 IEEE 23RD PACIFIC RIM INTERNATIONAL SYMPOSIUM ON DEPENDABLE COMPUTING (PRDC), 2018, : 279 - 288
  • [10] Unsupervised Anomaly and Change Detection With Multivariate Gaussianization
    Padron-Hidalgo, Jose A.
    Laparra, Valero
    Camps-Valls, Gustau
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60