Spatial-temporal graph attention network for video anomaly detection

被引:9
|
作者
Chen, Haoyang [1 ]
Mei, Xue [1 ]
Ma, Zhiyuan [1 ]
Wu, Xinhong [1 ]
Wei, Yachuan [1 ]
机构
[1] Nanjing Tech Univ, Nanjing 211816, Peoples R China
关键词
Video anomaly detection; Multiple instance learning; Graph convolutional network; Multi-head graph attention; CONVOLUTIONAL NETWORKS; LOCALIZATION;
D O I
10.1016/j.imavis.2023.104629
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video anomaly detection, which is weakly supervised by video-level annotations, is a frequent yet challenging task in computer vision owing to its unexpectedness, equivocality, rarity, irregularity, and diversity. Although previous seminal works successfully leveraged graph convolutions to assist in the detection of anomalies, they failed to subsequently explore the potential of this approach. In this study, we developed a spatial-temporal graph attention network (STGA), which, to the best of our knowledge, is the first effort to combine graph convolutions with a multi-head graph attention mechanism for video anomaly detection. Specifically, a spatial correlation graph and temporal dependence graph were devised to learn distinguishable representations with the complementation of graph attention techniques. Furthermore, STGA was incorporated within the multiple instance learning (MIL) pipeline, and optimized via top-k MIL ranking loss after a sparse continuous sampling strategy was put into effect. We conducted experiments using four multi-scale datasets to validate the efficacy of our model. Quantitatively, our method performs equivalently to the current best result on ShanghaiTech with a frame-level area under the curve (AUC) of 97.21%, obtains the second-best result on UCSD-Ped2 with a frame-level AUC of 97.4%, gains a frame-level AUC of 80.28% on UCF-Crime, and achieves a new state-of-the-art performance on TAD with a frame-level AUC of 91.42%. Additionally, our false alarm rate results outperform those obtained in previous studies on ShanghaiTech and UCF-Crime, which demonstrates the robustness of our approach. All relevant code has been made available at https://github.com/hychen96/STGA-VAD. & COPY; 2023 Elsevier B.V. All rights reserved.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Learning Graph Enhanced Spatial-Temporal Coherence for Video Anomaly Detection
    Cheng, Kai
    Liu, Yang
    Zeng, Xinhua
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 314 - 318
  • [2] MST-GAT: A multimodal spatial-temporal graph attention network for time series anomaly detection
    Ding, Chaoyue
    Sun, Shiliang
    Zhao, Jing
    [J]. INFORMATION FUSION, 2023, 89 : 527 - 536
  • [3] STGAT-MAD : SPATIAL-TEMPORAL GRAPH ATTENTION NETWORK FOR MULTIVARIATE TIME SERIES ANOMALY DETECTION
    Zhan, Jun
    Wang, Siqi
    Ma, Xiandong
    Wu, Chengkun
    Yang, Canqun
    Zeng, Detian
    Wang, Shilin
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 3568 - 3572
  • [4] Graph Neural Network for Fraud Detection via Spatial-Temporal Attention
    Cheng, Dawei
    Wang, Xiaoyang
    Zhang, Ying
    Zhang, Liqing
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (08) : 3800 - 3813
  • [5] Interpretable Spatial-Temporal Graph Convolutional Network for System Log Anomaly Detection
    Xu, Rucong
    Li, Yun
    [J]. ADVANCED ENGINEERING INFORMATICS, 2024, 62
  • [6] Spatial-Temporal Attention Network for Temporal Knowledge Graph Completion
    Zhang, Jiasheng
    Liang, Shuang
    Deng, Zhiyi
    Shao, Jie
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2021), PT I, 2021, 12681 : 207 - 223
  • [7] Memory-Augmented Spatial-Temporal Consistency Network for Video Anomaly Detection
    Li, Zhangxun
    Zhao, Mengyang
    Zeng, Xinhua
    Wang, Tian
    Pang, Chengxin
    [J]. PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT VI, 2024, 14430 : 95 - 107
  • [8] Exploiting Spatial-temporal Correlations for Video Anomaly Detection
    Zhao, Mengyang
    Liu, Yang
    Liu, Jing
    Zeng, Xinhua
    [J]. 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 1727 - 1733
  • [9] LEARNING TASK-SPECIFIC REPRESENTATION FOR VIDEO ANOMALY DETECTION WITH SPATIAL-TEMPORAL ATTENTION
    Liu, Yang
    Liu, Jing
    Zhu, Xiaoguang
    Wei, Donglai
    Huang, Xiaohong
    Song, Liang
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2190 - 2194
  • [10] Spatial-Temporal Graph Network for Video Crowd Counting
    Wu, Zhe
    Zhang, Xinfeng
    Tian, Geng
    Wang, Yaowei
    Huang, Qingming
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (01) : 228 - 241