Attention-guided generator with dual discriminator GAN for real-time video anomaly detection

被引:9
|
作者
Singh, Rituraj [1 ]
Sethi, Anikeit [1 ]
Saini, Krishanu [1 ]
Saurav, Sumeet [2 ]
Tiwari, Aruna [1 ]
Singh, Sanjay [2 ]
机构
[1] Indian Inst Technol Indore, Indore, India
[2] CSIR CEERI, Adv Informat Technol Grp, Pilani, Rajasthan, India
关键词
Generative adversarial networks (GAN); Adversarial learning; One-class classification (OCC); Video anomaly detection; ABNORMAL EVENT DETECTION; DEEP; ROBUST;
D O I
10.1016/j.engappai.2023.107830
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Detecting anomalies in videos presents a significant challenge in the field of video surveillance. The primary goal is identifying and detecting uncommon actions or events within a video sequence. The difficulty arises from the limited availability of video frames depicting anomalies and the ambiguous definition of anomaly. Based on extensive applications of Generative Adversarial Networks (GANs), which consist of a generator and a discriminator network, we propose an Attention -guided Generator with Dual Discriminator GAN (A2D-GAN) for real-time video anomaly detection (VAD). The generator network uses an encoder-decoder architecture with a multi -stage self -attention added to the encoder and multi -stage channel attention added to the decoder. The framework uses adversarial learning from noise and video frame reconstruction to enhance the generalization of the generator network. Also, of the dual discriminator in A2D-GAN, one discriminates between the reconstructed video frame and the real video frame, while the other discriminates between the reconstructed noise and the real noise. Exhaustive experiments and ablation studies on four benchmark video anomaly datasets, namely UCSD Peds, CUHK Avenue, ShanghaiTech, and Subway, demonstrate the effectiveness of the proposed A2D-GAN compared to other state-of-the-art methods. The proposed A2D-GAN model is robust and can detect anomalies in videos in real-time. The source code to replicate the results of the proposed A2D-GAN model is available at https://github.com/Rituraj-ksi/A2D-GAN.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Video Prediction and Anomaly Detection Algorithm Based On Dual Discriminator
    Fan, Sinuo
    Meng, Fanjie
    2020 5TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND APPLICATIONS (ICCIA 2020), 2020, : 123 - 127
  • [22] Attention-Guided Disentangled Feature Aggregation for Video Object Detection
    Muralidhara, Shishir
    Hashmi, Khurram Azeem
    Pagani, Alain
    Liwicki, Marcus
    Stricker, Didier
    Afzal, Muhammad Zeshan
    SENSORS, 2022, 22 (21)
  • [23] VALD-GAN: video anomaly detection using latent discriminator augmented GAN
    Singh, Rituraj
    Sethi, Anikeit
    Saini, Krishanu
    Saurav, Sumeet
    Tiwari, Aruna
    Singh, Sanjay
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (01) : 821 - 831
  • [24] VALD-GAN: video anomaly detection using latent discriminator augmented GAN
    Rituraj Singh
    Anikeit Sethi
    Krishanu Saini
    Sumeet Saurav
    Aruna Tiwari
    Sanjay Singh
    Signal, Image and Video Processing, 2024, 18 : 821 - 831
  • [25] Real-Time Surgical Tool Detection in Minimally Invasive Surgery Based on Attention-Guided Convolutional Neural Network
    Shi, Pan
    Zhao, Zijian
    Hu, Sanyuan
    Chang, Faliang
    IEEE ACCESS, 2020, 8 : 228853 - 228862
  • [26] Lightweight attention-guided redundancy-reuse network for real-time semantic segmentation
    Hu, Xuegang
    Xu, Shuhan
    Jing, Liyuan
    IET IMAGE PROCESSING, 2023, 17 (09) : 2649 - 2658
  • [27] Real-time stereo matching with high accuracy via Spatial Attention-Guided Upsampling
    Wu, Zhong
    Zhu, Hong
    He, Lili
    Zhao, Qiang
    Shi, Jing
    Wu, Wenhuan
    APPLIED INTELLIGENCE, 2023, 53 (20) : 24253 - 24274
  • [28] Real-time stereo matching with high accuracy via Spatial Attention-Guided Upsampling
    Zhong Wu
    Hong Zhu
    Lili He
    Qiang Zhao
    Jing Shi
    Wenhuan Wu
    Applied Intelligence, 2023, 53 : 24253 - 24274
  • [29] Lightweight multi-scale attention-guided network for real-time semantic segmentation
    Hu, Xuegang
    Liu, Yuanjing
    IMAGE AND VISION COMPUTING, 2023, 139
  • [30] CSART: Channel and spatial attention-guided residual learning for real-time object tracking
    Zhang, Dawei
    Zheng, Zhonglong
    Li, Minglu
    Liu, Rixian
    NEUROCOMPUTING, 2021, 436 : 260 - 272