Sample-Correlation-Aware Unsupervised Deep Anomaly Detection Model

被引:0
|
作者
Xi L. [1 ]
Wang R.-D. [1 ]
Fan H.-Y. [1 ]
Zhang F.-B. [1 ]
机构
[1] School of Computer Science and Technology, Harbin University of Science and Technology, Harbin
来源
关键词
Anomaly detection; Data correlation; Gaussian mixture model; Graph neural network;
D O I
10.11897/SP.J.1016.2021.02317
中图分类号
学科分类号
摘要
The goal of anomaly detection is to identify abnormal patterns within normal patterns. Anomaly detection is applied in different forms in different application scenarios, such as network security, medical image and video monitoring. How to make full use of all kinds of characteristic information of data to identify the anomaly is one of the hot spots of anomaly detection. Many intelligent algorithms and models in data mining, machine learning and deep learning have been used to training anomaly detection rules to improve their detection performances. The training methods of anomaly detection rules are divided into three kinds: supervised, semi-supervised and unsupervised methods, and the third kind is the most popular. Formally, unsupervised deep anomaly detection can be viewed as density estimation from the data distribution. At present, deep anomaly detection models have achieved remarkable results in different application scenarios. But these methods are mainly based on the original features of data for training, and ignore the complex correlation among data samples: there are usually some kinds of correlation among normal samples that abnormal samples do not have. This makes these methods not ideal for anomaly analysis of complex data with characteristics of large data size, high dimension, unbalanced abnormal proportion, etc. In view of this, in this paper, a sample-correlation-aware deep learning model is proposed and used for anomaly detection, named sample-correlation-aware unsupervised deep anomaly detection model(SCA-UDLM): First of all, through in-depth analysis of the original sample features and the correlation features among samples, the model uses the K-nearest neighbors algorithm to search for similar samples to extract the correlation features among samples and store them in an undigraph structure, where nodes represent the samples and the edges represent the correlation between the two samples. Secondly, the original features and correlation features are fused based on the dual autoencoder composed of feature encoder and graph encoder, and generated high-quality data embedding in the low-dimensional feature space. Thirdly, the decoder decodes the low-dimensional embedding into reconstruction samples with original dimensional, and calculates the reconstruction error and features. Finally, an estimation network based on Gaussian mixture model is designed to estimate the probability density of samples based on the input of the reconstruction features and the low-dimensional embedding which are fused by the additive fusion method, and judge whether these samples are abnormal or not based on the given judgment threshold. A large number of experiments and analyses were made on this model and relevant representative machine learning methods and up-to-date deep learning methods, such as, one class support vector machines (OC-SVM), isolation forests(IF), deep structured energy based models(DSEBM), deep autoencoding GMM(DAGMM), Anomaly generative adversarial network(AnoGAN), adversarially learned anomaly detection(ALAD). The experimental results show that the detection performances of this model are improved by about 2% compared with other related methods, and its experimental results with different parameters, modules and noises are more stable than other methods, which prove the validity of the correlation module. The visualization experimental results can also highlight the advantages of this model in data feature extraction and full utilization. © 2021, Science Press. All right reserved.
引用
收藏
页码:2317 / 2331
页数:14
相关论文
共 43 条
  • [1] Murali S, Jamalipour A., A lightweight intrusion detection for Sybil attack under mobile RPL in the Internet of things, IEEE Internet of Things Journal, 7, 1, pp. 379-388, (2020)
  • [2] Zhao C H, Yao X F., Progressive line processing of global and local real-time anomaly detection in hyperspectral images, Journal of Real-Time Image Processing, 16, 6, pp. 2289-2303, (2019)
  • [3] Nawaratne R, Alahakoon D, De Silva D, Et al., Spatiotemporal anomaly detection using deep learning for real-time video surveillance, IEEE Transactions on Industrial Informatics, 16, 1, pp. 393-402, (2020)
  • [4] Fernandes G, Rodrigues J J P C, Carvalho L F, Et al., A comprehensive survey on network anomaly detection, Telecom munication Systems, 70, 3, pp. 447-489, (2019)
  • [5] Yuan F N, Zhang L, Shi J T, Et al., Theories and applications of auto-encoder neural networks: a literature survey, Chinese Journal of Computers, 42, pp. 203-230, (2019)
  • [6] Fu X, Shen Y T, Fu L H, Et al., An optimized sparse auto-encoder network based on feature clustering, Acta Electronica Sinica, 46, pp. 1041-1046, (2018)
  • [7] Zong B, Song Q, Min M R, Et al., Deep autoencoding Gaussian mixture model for unsupervised anomaly detection, Proceedings of the 6th International Conference on Learning Representations, pp. 1-19, (2018)
  • [8] Amarasinghe K, Kenney K, Manic M., Toward explainable deep neural network based anomaly detection, Proceedings of the 11th International Conference on Human System Interaction, pp. 311-317, (2018)
  • [9] Tran C P, Tran D K., Anomaly detection in POSTFIX mail log using principal component analysis, Proceedings of the 10th International Conference on Knowledge and Systems Engineering, pp. 107-112, (2018)
  • [10] Wang M X, Zhou HC, Chen J., A moving window principal components analysis based anomaly detection and mitigation approach in SDN network, KSII Transactions on Internet and Information Systems, 12, 8, pp. 3946-3965, (2018)