Weakly Supervised Video Anomaly Detection via Transformer-Enabled Temporal Relation Learning

被引:22
|
作者
Zhang, Dasheng [1 ]
Huang, Chao [2 ]
Liu, Chengliang [2 ]
Xu, Yong [2 ,3 ]
机构
[1] Chongqing Univ, Sch Artificial Intelligence, Chongqing 401135, Peoples R China
[2] Harbin Inst Technol, Shenzhen Key Lab Visual Object Detect & Recognit, Shenzhen 518055, Peoples R China
[3] Peng Cheng Lab, Shenzhen 518055, Peoples R China
基金
国家重点研发计划;
关键词
Feature extraction; Transformers; Task analysis; Anomaly detection; Training; Surveillance; Training data; Deep learning; video anomaly detection; vision transformer; weakly-supervised learning;
D O I
10.1109/LSP.2022.3175092
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Weakly supervised video anomaly detection is a challenging problem due to the lack of frame-level labels in training videos. Most previous works typically tackle this task with the multiple instance learning paradigm, which divides a video into multiple snippets and trains a snippet classifier to distinguish anomalies from normal snippets via video-level supervision information. Although existing approaches achieve remarkable progresses, these solutions are still limited in the insufficient representations. In this paper, we propose a novel weakly supervised temporal relation learning framework for anomaly detection, which efficiently explores the temporal relation between snippets and enhances the discriminative powers of features using only video-level labelled videos. To this end, we design a transformer-enabled feature encoder to convert the input task-agnostic features into discriminative task-specific features by mining the semantic correlation and position relation between video snippets. As a result, our model can make a more accurate anomaly detection for current video snippet based on the learned discriminative features. Experimental results indicate that the proposed method is superior to existing state-of-the-art approaches, which demonstrates the effectiveness of our model.
引用
收藏
页码:1197 / 1201
页数:5
相关论文
共 50 条
  • [41] CLIP-Driven Multi-Scale Instance Learning for Weakly Supervised Video Anomaly Detection
    Qian, Zhangbin
    Tan, Jiawei
    Ou, Zhilong
    Wang, Hongxing
    Proceedings - IEEE International Conference on Multimedia and Expo, 2024,
  • [42] End-to-end learning for weakly supervised video anomaly detection using Absorbing Markov Chain
    Park, Jaeyoo
    Kim, Junha
    Han, Bohyung
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 236
  • [43] Fusing crops representation into snippet via mutual learning for weakly supervised surveillance anomaly detection
    Zhang, Bohua
    Xue, Jianru
    IET COMPUTER VISION, 2024,
  • [44] Anomaly Detection in Video via Self-Supervised and Multi-Task Learning
    Georgescu, Mariana-Iuliana
    Barbalau, Antonio
    Ionescu, Radu Tudor
    Khan, Fahad Shahbaz
    Popescu, Marius
    Shah, Mubarak
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 12737 - 12747
  • [45] Feature Differentiation Reconstruction Network for Weakly-Supervised Video Anomaly Detection
    Gong, Yiling
    Luo, Sihui
    Wang, Chong
    Zheng, Yujie
    IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 1462 - 1466
  • [46] Dual Memory Units with Uncertainty Regulation for Weakly Supervised Video Anomaly Detection
    Zhou, Hang
    Yu, Junqing
    Yang, Wei
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 3, 2023, : 3769 - 3777
  • [47] Exploiting Completeness and Uncertainty of Pseudo Labels for Weakly Supervised Video Anomaly Detection
    Zhang, Chen
    Li, Guorong
    Qi, Yuankai
    Wang, Shuhui
    Qing, Laiyun
    Huang, Qingming
    Yang, Ming-Hsuan
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 16271 - 16280
  • [48] Scale-Aware Spatio-Temporal Relation Learning for Video Anomaly Detection
    Li, Guoqiu
    Cai, Guanxiong
    Zeng, Xingyu
    Zhao, Rui
    COMPUTER VISION - ECCV 2022, PT IV, 2022, 13664 : 333 - 350
  • [49] DAM : Dissimilarity Attention Module for Weakly-supervised Video Anomaly Detection
    Majhi, Snehashis
    Das, Srijan
    Bremond, Francois
    2021 17TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS 2021), 2021,
  • [50] Cross-Modal Attention Mechanism for Weakly Supervised Video Anomaly Detection
    Sun, Wenwen
    Cao, Lin
    Guo, Yanan
    Du, Kangning
    BIOMETRIC RECOGNITION, CCBR 2023, 2023, 14463 : 437 - 446