Semisupervised anomaly detection of multivariate time series based on a variational autoencoder

被引:13
|
作者
Chen, Ningjiang [1 ,2 ,3 ]
Tu, Huan [1 ]
Duan, Xiaoyan [1 ]
Hu, Liangqing [1 ]
Guo, Chengxiang [4 ]
机构
[1] Guangxi Univ, Coll Comp & Elect Informat, Nanning 530004, Guangxi, Peoples R China
[2] Guangxi Coll & Univ Key Lab Parallel & Distribute, Nanning 530000, Guangxi, Peoples R China
[3] Guangxi Key Lab Multimedia Commun & Network Techn, Nanning 530000, Guangxi, Peoples R China
[4] Guangxi Univ Chinese Med, Nanning 530200, Guangxi, Peoples R China
关键词
Multivariate time series; Anomaly detection; Semisupervised learning; VAE; LSTM; Attention mechanism;
D O I
10.1007/s10489-022-03829-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In a large-scale cloud environment, many key performance indicators (KPIs) of entities are monitored in real time. These multivariate time series consist of high-dimensional, high-noise, random and time-dependent data. As a common method implemented in artificial intelligence for IT operations (AIOps), time series anomaly detection has been widely studied and applied. However, the existing detection methods cannot fully consider the influence of multiple factors and cannot quickly and accurately detect anomalies in multivariate KPIs of entities. Concurrently, fine-grained root cause locations cannot be determined for detected anomalies and often require abundant normal data that are difficult to obtain for model training. To solve these problems, we propose a long short-term memory (LSTM)-based semisupervised variational autoencoder (VAE) anomaly detection strategy called LR-SemiVAE. First, LR-SemiVAE uses VAE to perform feature dimension reduction and reconstruction of multivariate time series data and judges whether the entity is abnormal by calculating the reconstruction probability score. Second, by introducing an LSTM network into the VAE encoder and decoder, the model can fully learn the time dependence of multivariate time series. Then, LR-SemiVAE predicts the data labels by introducing a classifier to reduce the dependence on the original labeled data during model training. Finally, by proposing a new evidence lower bound (ELBO) loss function calculation method, LR-SemiVAE pays attention to the normal pattern and ignores the abnormal pattern during training to reduce the time cost of removing random anomaly and noise data. However, due to the limitations of LSTM in learning the long-term dependence of time series data, based on LR-SemiVAE, we propose a transformer-based semisupervised VAE anomaly detection and location strategy called RT-SemiVAE for cluster systems with complex service dependencies. This method learns the long-term dependence of multivariate time series by introducing a parallel multihead attention mechanism transformer, while LSTM is used to capture short-term dependence, and the introduction of parallel computing also markedly reduces model training time. After RT-SemiVAE detects entity anomalies, it traces the root entities according to the obtained service dependence graph and locates the root causes at the indicator level. We verify the strategies by using public data sets and constructing a system prototype. Experimental results show that compared with existing baseline methods, the LR-SemiVAE and RT-SemiVAE strategies can detect anomalies more quickly and accurately and perform fine-grained and accurate localization of the root causes of anomalies.
引用
收藏
页码:6074 / 6098
页数:25
相关论文
共 50 条
  • [1] Semisupervised anomaly detection of multivariate time series based on a variational autoencoder
    Ningjiang Chen
    Huan Tu
    Xiaoyan Duan
    Liangqing Hu
    Chengxiang Guo
    [J]. Applied Intelligence, 2023, 53 : 6074 - 6098
  • [2] Robust Unsupervised Anomaly Detection With Variational Autoencoder in Multivariate Time Series Data
    Yokkampon, Umaporn
    Mowshowitz, Abbe
    Chumkamon, Sakmongkon
    Hayashi, Eiji
    [J]. IEEE ACCESS, 2022, 10 : 57835 - 57849
  • [3] Unsupervised Anomaly Detection in Multivariate Time Series through Transformer-based Variational Autoencoder
    Zhang, Hongwei
    Xia, Yuanqing
    Yan, Tijin
    Liu, Guiyang
    [J]. PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 281 - 286
  • [4] Contrastive autoencoder for anomaly detection in multivariate time series
    Zhou, Hao
    Yu, Ke
    Zhang, Xuan
    Wu, Guanlin
    Yazidi, Anis
    [J]. INFORMATION SCIENCES, 2022, 610 : 266 - 280
  • [5] Multivariate time series anomaly detection with variational autoencoder and spatial-temporal graph network
    Guan, Siwei
    He, Zhiwei
    Ma, Shenhui
    Gao, Mingyu
    [J]. COMPUTERS & SECURITY, 2024, 142
  • [6] Improved Variational Autoencoder Anomaly Detection in Time Series Data
    Yokkampon, Umaporn
    Chumkamon, Sakmongkon
    Mowshowitz, Abbe
    Fujisawa, Ryusuke
    Hayashi, Eiji
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2020, : 82 - 87
  • [7] Variational transformer-based anomaly detection approach for multivariate time series
    Wang, Xixuan
    Pi, Dechang
    Zhang, Xiangyan
    Liu, Hao
    Guo, Chang
    [J]. MEASUREMENT, 2022, 191
  • [8] An adversarial contrastive autoencoder for robust multivariate time series anomaly detection
    Yu, Jiahao
    Gao, Xin
    Zhai, Feng
    Li, Baofeng
    Xue, Bing
    Fu, Shiyuan
    Chen, Lingli
    Meng, Zhihang
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 245
  • [9] Federated Variational Learning for Anomaly Detection in Multivariate Time Series
    Zhang, Kai
    Jiang, Yushan
    Seversky, Lee
    Xu, Chengtao
    Liu, Dahai
    Song, Houbing
    [J]. 2021 IEEE INTERNATIONAL PERFORMANCE, COMPUTING, AND COMMUNICATIONS CONFERENCE (IPCCC), 2021,
  • [10] MST-VAE: Multi-Scale Temporal Variational Autoencoder for Anomaly Detection in Multivariate Time Series
    Pham, Tuan-Anh
    Lee, Jong-Hoon
    Park, Choong-Shik
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (19):