A Comparative Evaluation of Unsupervised Anomaly Detection Algorithms for Multivariate Data

被引:544
|
作者
Goldstein, Markus [1 ]
Uchida, Seiichi [2 ]
机构
[1] Kyushu Univ, Ctr Coevolut Social Syst Innovat, Fukuoka 812, Japan
[2] Kyushu Univ, Dept Adv Informat Technol, Fukuoka 812, Japan
来源
PLOS ONE | 2016年 / 11卷 / 04期
基金
日本科学技术振兴机构;
关键词
NOVELTY DETECTION;
D O I
10.1371/journal.pone.0152173
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Anomaly detection is the process of identifying unexpected items or events in datasets, which differ from the norm. In contrast to standard classification tasks, anomaly detection is often applied on unlabeled data, taking only the internal structure of the dataset into account. This challenge is known as unsupervised anomaly detection and is addressed in many practical applications, for example in network intrusion detection, fraud detection as well as in the life science and medical domain. Dozens of algorithms have been proposed in this area, but unfortunately the research community still lacks a comparative universal evaluation as well as common publicly available datasets. These shortcomings are addressed in this study, where 19 different unsupervised anomaly detection algorithms are evaluated on 10 different datasets from multiple application domains. By publishing the source code and the datasets, this paper aims to be a new well-funded basis for unsupervised anomaly detection research. Additionally, this evaluation reveals the strengths and weaknesses of the different approaches for the first time. Besides the anomaly detection performance, computational effort, the impact of parameter settings as well as the global/local anomaly detection behavior is outlined. As a conclusion, we give an advise on algorithm selection for typical real-world tasks.
引用
收藏
页数:31
相关论文
共 50 条
  • [21] Unsupervised Online Anomaly Detection on Multivariate Sensing Time Series Data for Smart Manufacturing
    Hsieh, Ruei-Jie
    Chou, Jerry
    Ho, Chih-Hsiang
    2019 IEEE 12TH CONFERENCE ON SERVICE-ORIENTED COMPUTING AND APPLICATIONS (SOCA 2019), 2019, : 90 - 97
  • [22] A Deep Neural Network for Unsupervised Anomaly Detection and Diagnosis in Multivariate Time Series Data
    Zhang, Chuxu
    Song, Dongjin
    Chen, Yuncong
    Feng, Xinyang
    Lumezanu, Cristian
    Cheng, Wei
    Ni, Jingchao
    Zong, Bo
    Chen, Haifeng
    Chawla, Nitesh V.
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 1409 - 1416
  • [23] Unsupervised Anomaly Detection in Transactional Data
    Bouguessa, Mohamed
    2012 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2012), VOL 1, 2012, : 526 - 531
  • [24] Quantitative Comparison of Unsupervised Anomaly Detection Algorithms for Intrusion Detection
    Falcao, Filipe
    Zoppi, Tommaso
    Viera Silva, Caio Barbosa
    Santos, Anderson
    Fonseca, Baldoino
    Ceccarelli, Andrea
    Bondavalli, Andrea
    SAC '19: PROCEEDINGS OF THE 34TH ACM/SIGAPP SYMPOSIUM ON APPLIED COMPUTING, 2019, : 318 - 327
  • [25] Multivariate Time Series Anomaly Detection: Fancy Algorithms and Flawed Evaluation Methodology
    Sehili, Mohamed El Amine
    Zhang, Zonghua
    PERFORMANCE EVALUATION AND BENCHMARKING, TPCTC 2023, 2024, 14247 : 1 - 17
  • [26] Comparative Evaluation of Anomaly Detection Techniques for Sequence Data
    Chandola, Varun
    Mithal, Varun
    Kumar, Vipin
    ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, : 743 - +
  • [27] A Survey on Unsupervised Anomaly Detection Algorithms for Industrial Images
    Cui, Yajie
    Liu, Zhaoxiang
    Lian, Shiguo
    IEEE ACCESS, 2023, 11 : 55297 - 55315
  • [28] DAEMON: Unsupervised Anomaly Detection and Interpretation for Multivariate Time Series
    Chen, Xuanhao
    Deng, Liwei
    Huang, Feiteng
    Zhang, Chengwei
    Zhang, Zongquan
    Zhao, Yan
    Zheng, Kai
    2021 IEEE 37TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2021), 2021, : 2225 - 2230
  • [29] Unsupervised Deep Variational Model for Multivariate Sensor Anomaly Detection
    Asres, Mulugeta Weldezgina
    Cummings, Grace
    Parygin, Pavel
    Khukhunaishvili, Aleko
    Toms, Maria
    Campbell, Alan
    Cooper, Seth, I
    Yu, David
    Dittmann, Jay
    Omlin, Christian W.
    PROCEEDINGS OF THE 2021 IEEE INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATICS AND COMPUTING (PIC), 2021, : 364 - 371
  • [30] Multivariate Unsupervised Machine Learning for Anomaly Detection in Enterprise Applications
    Elsner, Daniel
    Khosroshahi, Pouya Aleatrati
    MacCormack, Alan D.
    Lagerstrom, Robert
    PROCEEDINGS OF THE 52ND ANNUAL HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES, 2019, : 5827 - 5836