An Iterative Method for Unsupervised Robust Anomaly Detection Under Data Contamination

被引:0
|
作者
Kim, Minkyung [1 ]
Yu, Jongmin [2 ]
Kim, Junsik [3 ]
Oh, Tae-Hyun [4 ,5 ,6 ]
Choi, Jun Kyun [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Sch Elect Engn, Daejeon 34141, South Korea
[2] Kings Coll London, Dept Engn, London WC2R 2LS, England
[3] Harvard Univ, Sch Engn & Appl Sci, Cambridge, MA 02138 USA
[4] POSTECH, Dept Elect Engn, Pohang 37673, South Korea
[5] POSTECH, Grad Sch AI GSAI, Pohang 37673, South Korea
[6] Yonsei Univ, Inst Convergence Res & Educ Adv Technol, Seoul 03722, South Korea
基金
新加坡国家研究基金会;
关键词
Anomaly detection; Contamination; Training; Data models; Pollution measurement; Iterative methods; Neural networks; contaminated dataset; iterative learning; normality; unsupervised learning; NETWORK; SUPPORT;
D O I
10.1109/TNNLS.2023.3267028
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most deep anomaly detection models are based on learning normality from datasets due to the difficulty of defining abnormality by its diverse and inconsistent nature. Therefore, it has been a common practice to learn normality under the assumption that anomalous data are absent in a training dataset, which we call normality assumption. However, in practice, the normality assumption is often violated due to the nature of real data distributions that includes anomalous tails, i.e., a contaminated dataset. Thereby, the gap between the assumption and actual training data affects detrimentally in learning of an anomaly detection model. In this work, we propose a learning framework to reduce this gap and achieve better normality representation. Our key idea is to identify sample-wise normality and utilize it as an importance weight, which is updated iteratively during the training. Our framework is designed to be model-agnostic and hyperparameter insensitive so that it applies to a wide range of existing methods without careful parameter tuning. We apply our framework to three different representative approaches of deep anomaly detection that are classified into one-class classification-, probabilistic model-, and reconstruction-based approaches. In addition, we address the importance of a termination condition for iterative methods and propose a termination criterion inspired by the anomaly detection objective. We validate that our framework improves the robustness of the anomaly detection models under different levels of contamination ratios on five anomaly detection benchmark datasets and two image datasets. On various contaminated datasets, our framework improves the performance of three representative anomaly detection methods, measured by area under the ROC curve.
引用
收藏
页码:1 / 13
页数:13
相关论文
共 50 条
  • [11] Coping with Training Contamination in Unsupervised Distributional Anomaly Detection
    Borges, Nash
    Meyer, Gerard G. L.
    2009 43RD ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS, VOLS 1 AND 2, 2009, : 264 - 269
  • [12] Robust Unsupervised Anomaly Detection With Variational Autoencoder in Multivariate Time Series Data
    Yokkampon, Umaporn
    Mowshowitz, Abbe
    Chumkamon, Sakmongkon
    Hayashi, Eiji
    IEEE ACCESS, 2022, 10 : 57835 - 57849
  • [13] Unsupervised Anomaly Detection in Transactional Data
    Bouguessa, Mohamed
    2012 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2012), VOL 1, 2012, : 526 - 531
  • [14] Hybrid robust convolutional autoencoder for unsupervised anomaly detection of machine tools under noises
    Yan, Shen
    Shao, Haidong
    Xiao, Yiming
    Liu, Bin
    Wan, Jiafu
    ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING, 2023, 79
  • [15] Unsupervised Anomaly Detection in Sequential Process Data
    Bulut, Okan
    Gorgun, Guher
    He, Surina
    ZEITSCHRIFT FUR PSYCHOLOGIE-JOURNAL OF PSYCHOLOGY, 2024, 232 (02): : 74 - 94
  • [16] Unsupervised Anomaly Detection in Data Quality Control
    Poon, Lex
    Farshidi, Siamak
    Li, Na
    Zhao, Zhiming
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 2327 - 2336
  • [17] Unsupervised Anomaly Detection on Temporal Multiway Data
    Duc Nguyen
    Phuoc Nguyen
    Kien Do
    Rana, Santu
    Gupta, Sunil
    Truyen Tran
    2020 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2020, : 1059 - 1066
  • [18] SoftPatch: Unsupervised Anomaly Detection with Noisy Data
    Jiang, Xi
    Liu, Jianlin
    Wang, Jinbao
    Nie, Qian
    Wu, Kai
    Liu, Yong
    Wang, Chengjie
    Zheng, Feng
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [19] Unsupervised Nonparametric Anomaly Detection: A Kernel Method
    Zou, Shaofeng
    Liang, Yingbin
    Poor, H. Vincent
    Shi, Xinghua
    2014 52ND ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2014, : 836 - 841
  • [20] A simple method for unsupervised anomaly detection: An application to Web time series data
    Yoshihara, Keisuke
    Takahashi, Kei
    PLOS ONE, 2022, 17 (01):