Deep Multi-view Representation Learning for Video Anomaly Detection Using Spatiotemporal Autoencoders

被引:20
|
作者
Deepak, K. [1 ]
Srivathsan, G. [1 ]
Roshan, S. [1 ]
Chandrakala, S. [1 ]
机构
[1] SASTRA Univ, Sch Comp, Intelligent Syst Grp, Thanjavur 613401, India
关键词
Video anomaly detection; 3D spatiotemporal autoencoder; Multi-view representation learning; Spatiotemporal autocorrelation of gradients (STACOG); One-class SVM; ABNORMAL EVENT DETECTION;
D O I
10.1007/s00034-020-01522-7
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Visual perception is a transformative technology that can recognize patterns from environments through visual inputs. Automatic surveillance of human activities has gained significant importance in both public and private spaces. It is often difficult to understand the complex dynamics of events in real-time scenarios due to camera movements, cluttered backgrounds, and occlusion. Existing anomaly detection systems are not efficient because of high intra-class variations and inter-class similarities existing among activities. Hence, there is a demand to explore different kinds of information extracted from surveillance videos to improve overall performance. This can be achieved by learning features from multiple forms (views) of the given raw input data. We propose two novel methods based on the multi-view representation learning framework. The first approach is a hybrid multi-view representation learning that combines deep features extracted from 3D spatiotemporal autoencoder (3D-STAE) and robust handcrafted features based on spatiotemporal autocorrelation of gradients. The second approach is a deep multi-view representation learning that combines deep features extracted from two-stream STAEs to detect anomalies. Results on three standard benchmark datasets, namely Avenue, Live Videos, and BEHAVE, show that the proposed multi-view representations modeled with one-class SVM perform significantly better than most of the recent state-of-the-art methods.
引用
收藏
页码:1333 / 1349
页数:17
相关论文
共 50 条
  • [1] Deep Multi-view Representation Learning for Video Anomaly Detection Using Spatiotemporal Autoencoders
    K. Deepak
    G. Srivathsan
    S. Roshan
    S. Chandrakala
    [J]. Circuits, Systems, and Signal Processing, 2021, 40 : 1333 - 1349
  • [2] Video semantic segmentation using deep multi-view representation learning
    Sellami, Akrem
    Tabbone, Salvatore
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 8133 - 8139
  • [3] Anomaly detection in genomic catalogues using unsupervised multi-view autoencoders
    Ferre, Quentin
    Cheneby, Jeanne
    Puthier, Denis
    Capponi, Cecile
    Ballester, Benoit
    [J]. BMC BIOINFORMATICS, 2021, 22 (01)
  • [4] Anomaly detection in genomic catalogues using unsupervised multi-view autoencoders
    Quentin Ferré
    Jeanne Chèneby
    Denis Puthier
    Cécile Capponi
    Benoît Ballester
    [J]. BMC Bioinformatics, 22
  • [5] Spatiotemporal Representation Learning for Video Anomaly Detection
    Li, Zhaoyan
    Li, Yaoshun
    Gao, Zhisheng
    [J]. IEEE ACCESS, 2020, 8 : 25531 - 25542
  • [6] On Deep Multi-View Representation Learning
    Wang, Weiran
    Arora, Raman
    Livescu, Karen
    Bilmes, Jeff
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 1083 - 1092
  • [7] DEEP MULTI-VIEW ROBUST REPRESENTATION LEARNING
    Jiao, Zhenyu
    Xu, Chao
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 2851 - 2855
  • [8] Hierarchical graph augmented stacked autoencoders for multi-view representation learning
    Gou, Jianping
    Xie, Nannan
    Liu, Jinhua
    Yu, Baosheng
    Ou, Weihua
    Yi, Zhang
    Chen, Wu
    [J]. INFORMATION FUSION, 2024, 102
  • [9] Multi-View Representation Learning With Deep Gaussian Processes
    Sun, Shiliang
    Dong, Wenbo
    Liu, Qiuyang
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (12) : 4453 - 4468
  • [10] Deep multi-view representation learning for social images
    Huang, Feiran
    Zhang, Xiaoming
    Zhao, Zhonghua
    Li, Zhoujun
    He, Yueying
    [J]. APPLIED SOFT COMPUTING, 2018, 73 : 106 - 118