Multiscale spatial temporal attention graph convolution network for skeleton-based anomaly behavior detection

被引:6
|
作者
Chen, Xiaoyu [1 ,2 ]
Kan, Shichao [3 ]
Zhang, Fanghui [1 ,2 ]
Cen, Yigang [1 ,2 ]
Zhang, Linna [4 ]
Zhang, Damin [5 ]
机构
[1] Beijing Jiaotong Univ, Inst Informat Sci, Beijing 100044, Peoples R China
[2] Beijing Key Lab Adv Informat Sci & Network Technol, Beijing 100044, Peoples R China
[3] Cent South Univ, Sch Comp Sci & Engn, Changsha 410083, Hunan, Peoples R China
[4] Guizhou Univ, Coll Mech Engn, Guiyang 550025, Guizhou, Peoples R China
[5] Guizhou Univ, Coll Big Data & Informat Engn, Guiyang 550025, Guizhou, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Multiscale spatial temporal graph; Spatial attention graph convolution; Skeleton-based anomaly behavior detection; NEURAL-NETWORKS;
D O I
10.1016/j.jvcir.2022.103707
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Anomaly behavior detection plays a significant role in emergencies such as robbery. Although a lot of works have been proposed to deal with this problem, the performance in real applications is still relatively low. Here, to detect abnormal human behavior in videos, we propose a multiscale spatial temporal attention graph convolution network (MSTA-GCN) to capture and cluster the features of the human skeleton. First, based on the human skeleton graph, a multiscale spatial temporal attention graph convolution block (MSTA-GCB) is built which contains multiscale graphs in temporal and spatial dimensions. MSTA-GCB can simulate the motion relations of human body components at different scales where each scale corresponds to different granularity of annotation levels on the human skeleton. Then, static, globally-learned and attention-based adjacency matrices in the graph convolution module are proposed to capture hierarchical representation. Finally, extensive experiments are carried out on the ShanghaiTech Campus and CUHK Avenue datasets, the final results of the frame-level AUC/EER are 0.759/0.311 and 0.876/0.192, respectively. Moreover, the frame-level AUC is 0.768 for the human-related ShanghaiTech subset. These results show that our MSTA-GCN outperforms most of methods in video anomaly detection and we have obtained a new state-of-the-art performance in skeleton-based anomaly behavior detection.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] On the spatial attention in spatio-temporal graph convolutional networks for skeleton-based human action recognition
    Heidari, Negar
    Iosifidis, Alexandros
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [42] Multi-Scale Spatial Temporal Graph Convolutional Network for Skeleton-Based Action Recognition
    Chen, Zhan
    Li, Sicheng
    Yang, Bing
    Li, Qinghan
    LiU, Hong
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 1113 - 1122
  • [43] Attention module-based spatial-temporal graph convolutional networks for skeleton-based action recognition
    Kong, Yinghui
    Li, Li
    Zhang, Ke
    Ni, Qiang
    Han, Jungong
    JOURNAL OF ELECTRONIC IMAGING, 2019, 28 (04)
  • [44] Multi-Scale Spatial Temporal Graph Neural Network for Skeleton-Based Action Recognition
    Feng, Dong
    Wu, ZhongCheng
    Zhang, Jun
    Ren, TingTing
    IEEE ACCESS, 2021, 9 : 58256 - 58265
  • [45] Self-Relational Graph Convolution Network for Skeleton-Based Action Recognition
    Yussif, Sophyani Banaamwini
    Xie, Ning
    Yang, Yang
    Shen, Heng Tao
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 27 - 36
  • [46] Mixed graph convolution and residual transformation network for skeleton-based action recognition
    Shuhua Liu
    Xiaoying Bai
    Ming Fang
    Lanting Li
    Chih-Cheng Hung
    Applied Intelligence, 2022, 52 : 1544 - 1555
  • [47] Spatial Temporal Graph Convolutional Networks for Skeleton-Based Action Recognition
    Yan, Sijie
    Xiong, Yuanjun
    Lin, Dahua
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 7444 - 7452
  • [48] An Efficient Graph Convolution Network for Skeleton-Based Dynamic Hand Gesture Recognition
    Peng, Sheng-Hui
    Tsai, Pei-Hsuan
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2023, 15 (04) : 2179 - 2189
  • [49] Mixed graph convolution and residual transformation network for skeleton-based action recognition
    Liu, Shuhua
    Bai, Xiaoying
    Fang, Ming
    Li, Lanting
    Hung, Chih-Cheng
    APPLIED INTELLIGENCE, 2022, 52 (02) : 1544 - 1555
  • [50] Skeleton-based attention-aware spatial-temporal model for action detection and recognition
    Cui, Ran
    Zhu, Aichun
    Wu, Jingran
    Hua, Gang
    IET COMPUTER VISION, 2020, 14 (05) : 177 - 184