Transformer-enabled weakly supervised abnormal event detection in intelligent video surveillance systems

被引:0
|
作者
Paulraj, Shalmiya [1 ]
Vairavasundaram, Subramaniyaswamy [2 ]
机构
[1] SASTRA Deemed Univ, Sch Comp, Thanjavur 613401, India
[2] Vellore Inst Technol, Sch Comp Sci & Engn, Vellore 632014, India
关键词
Artificial intelligence; Abnormal event detection; Computer vision; Transformer models; Global self-attention; Intelligent video surveillance; Real-time monitoring;
D O I
10.1016/j.engappai.2024.109496
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Video Anomaly Detection (VAD) for weakly supervised data operates with limited video-level annotations. It also holds the practical significance to play a pivotal role in surveillance and security applications like public safety, patient monitoring, autonomous vehicles, etc. Moreover, VAD extends its utility to various industrial settings, where it is instrumental in safeguarding workers' safety, enabling real-time production quality monitoring, and predictive maintenance. These diverse applications highlight the versatility of VAD and its potential to transform processes across various industries, making it an essential tool along with traditional surveillance applications. The majority of the existing studies have been focused on mitigating critical aspects of VAD, such as reducing false alarm rates and misdetection. These challenges can be effectively addressed by capturing the intricate spatiotemporal pattern within video data. Therefore, the proposed work named Swin Transformer-based Hybrid Temporal Adaptive Module (ST-HTAM) Abnormal Event Detection introduces an intuitive temporal module along with leveraging the strengths of the Swin (Shifted window-based) Transformers for spatial analysis. The novel aspect of this work lies in the hybridization of global self-attention and Convolutional-Long Short Term Memory (C-LSTM) Networks are renowned for capturing both global and local temporal dependencies. By extracting these spatial and temporal components, the proposed method, ST-HTAM, offers a comprehensive understanding of anomalous events. Altogether, it enhances the accuracy and robustness of Weakly Supervised VAD (WS-VAD). Finally, an anomaly scoring mechanism is employed in the classification step to facilitate effective anomaly detection from test video data. The proposed system is tailored to operate in real-time and highlights the dual focus on sophisticated Artificial Intelligence (AI) techniques and their impactful use cases across diverse domains. Comprehensive experiments are conducted on benchmark datasets that clearly show the substantial superiority of the ST-HTAM over state-of-the-art approaches. Code is available at https://github. com/Shalmiyapaulraj78/STHTAM-VAD.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Abnormal event detection for video surveillance using deep one-class learning
    Sun, Jiayu
    Shao, Jie
    He, Chengkun
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (03) : 3633 - 3647
  • [42] Abnormal event detection for video surveillance using deep one-class learning
    Jiayu Sun
    Jie Shao
    Chengkun He
    Multimedia Tools and Applications, 2019, 78 : 3633 - 3647
  • [43] Unsupervised learning approach for abnormal event detection in surveillance video by revealing infrequent patterns
    Sandhan, Tushar
    Srivastava, Tushar
    Sethi, Amit
    Choi, Jin Young
    PROCEEDINGS OF 2013 28TH INTERNATIONAL CONFERENCE ON IMAGE AND VISION COMPUTING NEW ZEALAND (IVCNZ 2013), 2013, : 494 - 499
  • [44] A hybrid generative-discriminative model for abnormal event detection in surveillance video scenes
    Kumar P.M.A.
    Kavitha D.
    Kumar S.A.
    International Journal of Information and Computer Security, 2020, 12 (2-3) : 253 - 268
  • [45] Group Event Detection for Video Surveillance
    Lin, Weiyao
    Sun, Ming-Ting
    Poovendran, Radha
    Zhang, Zhengyou
    ISCAS: 2009 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-5, 2009, : 2830 - +
  • [46] Self-Training Multi-Sequence Learning with Transformer for Weakly Supervised Video Anomaly Detection
    Li, Shuo
    Liu, Fang
    Jiao, Licheng
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 1395 - 1403
  • [47] A novel method for moving object detection in intelligent video surveillance systems
    Zhao, Mingying
    Zhao, Jun
    Zhao, Shuguang
    Wang, Yuan
    2006 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY, PTS 1 AND 2, PROCEEDINGS, 2006, : 1797 - 1800
  • [48] Weakly supervised learning of a classifier for unusual event detection
    Jaeger, Mark
    Knoll, Christian
    Hamprecht, Fred A.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2008, 17 (09) : 1700 - 1708
  • [49] AFFINITY MIXUP FOR WEAKLY SUPERVISED SOUND EVENT DETECTION
    Izadi, Mohammad Rasool
    Stevenson, Robert
    Kloepper, Laura
    2021 IEEE 31ST INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2021,
  • [50] DURATION ROBUST WEAKLY SUPERVISED SOUND EVENT DETECTION
    Dinkel, Heinrich
    Yu, Kai
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 311 - 315