Unsupervised Anomaly Detection for Multivariate Incomplete Data using GAN-based Data Imputation: A Comparative Study

被引:0
|
作者
Sarda, Kisan [1 ]
Yerudkar, Amol [2 ]
Del Vecchio, Carmen [1 ]
机构
[1] Univ Sannio, Dept Engn, I-82100 Benevento, Italy
[2] Zhejiang Normal Univ, Sch Math Sci, Jinhua 321004, Zhejiang, Peoples R China
关键词
D O I
10.1109/MED59994.2023.10185791
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the increasing interconnectivity of cyber-physical systems (CPSs) in various fields, such as manufacturing plants, power plants, and smart networked systems, large amounts of multivariate data are generated through sensors and actuators, also other data sources such as measurements and images. This paper focuses on the anomaly detection (AD) problem, also known as fault detection or outlier detection, depending on the type of dataset, which involves identifying anomalous values in the dataset using analytical methods. However, datasets often contain missing values, which can lead to incorrect outcomes and affect the availability of anomalous samples that are fewer in amount, making incomplete datasets. Therefore, a generalized AD method is proposed for incomplete datasets, which involves two steps: data imputation (DI) to obtain complete datasets using GAN and later AD for the complete datasets. While statistical-based imputation methods are commonly used, they do not consider data distribution for datasets with anomalous samples. The capabilities of GAN-based DI are tested under different hyperparameter settings and percentages of missing values. The AD problem is then addressed using seven unsupervised anomaly detection methods on six different datasets, including a real dataset from a steel manufacturing plant in Italy. Each dataset is analyzed to determine which DI and AD method combination performs the best. The results show that GAN-imputed data provides the best DI performance, while the reweighted minimum covariance determinant (RMCD) method offers the overall best AD results combined with GAN.
引用
收藏
页码:55 / 60
页数:6
相关论文
共 50 条
  • [1] A Comparative Evaluation of Unsupervised Anomaly Detection Algorithms for Multivariate Data
    Goldstein, Markus
    Uchida, Seiichi
    [J]. PLOS ONE, 2016, 11 (04):
  • [2] GAN-based Anomaly Detection and Localization of Multivariate Time Series Data for Power Plant
    Choi, Yeji
    Lim, Hyunki
    Choi, Heeseung
    Kim, Ig-Jae
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP 2020), 2020, : 71 - 74
  • [3] DGTAD: decomposition GAN-based transformer for anomaly detection in multivariate time series data
    Chen, Zixin
    Yu, Jiong
    Tan, Qiyin
    Li, Shu
    Du, XuSheng
    [J]. Applied Intelligence, 2024, 54 (24) : 13038 - 13056
  • [4] A Comparative Study of Unsupervised Anomaly Detection Techniques Using Honeypot Data
    Song, Jungsuk
    Takakura, Hiroki
    Okabe, Yasuo
    Inoue, Daisuke
    Eto, Masashi
    Nakao, Koji
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2010, E93D (09) : 2544 - 2554
  • [5] Studies on the GAN-Based Anomaly Detection Methods for the Time Series Data
    Lee, Chang-Ki
    Cheon, Yu-Jeong
    Hwang, Wook-Yeon
    [J]. IEEE ACCESS, 2021, 9 : 73201 - 73215
  • [6] Efficient GAN-Based Unsupervised Anomaly Sound Detection for Refrigeration Units
    Hatanaka, Shouichi
    Nishi, Hiroaki
    [J]. PROCEEDINGS OF 2021 IEEE 30TH INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS (ISIE), 2021,
  • [7] GAN-Based Anomaly Detection for Multivariate Time Series Using Polluted Training Set
    Du, Bowen
    Sun, Xuanxuan
    Ye, Junchen
    Cheng, Ke
    Wang, Jingyuan
    Sun, Leilei
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (12) : 12208 - 12219
  • [8] GAN-Based Data Augmentation Strategy for Sensor Anomaly Detection in Industrial Robots
    Lu, Haodong
    Du, Miao
    Qian, Kai
    He, Xiaoming
    Wang, Kun
    [J]. IEEE SENSORS JOURNAL, 2022, 22 (18) : 17464 - 17474
  • [9] DOPING: Generative Data Augmentation for Unsupervised Anomaly Detection with GAN
    Lim, Swee Kiat
    Loo, Yi
    Ngoc-Trung Tran
    Ngai-Man Cheung
    Roig, Gemma
    Elovici, Yuval
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, : 1122 - 1127
  • [10] MIM-GAN-based Anomaly Detection for Multivariate Time Series Data
    Lu, Shan
    Dong, Zhicheng
    Cai, Donghong
    Fang, Fang
    Zhao, Dongcai
    [J]. 2023 IEEE 98TH VEHICULAR TECHNOLOGY CONFERENCE, VTC2023-FALL, 2023,