Predicting skeleton trajectories using a Skeleton-Transformer for video anomaly detection

被引:0
|
作者
Wenfeng Pang
Qianhua He
Yanxiong Li
机构
[1] South China University of Technology,School of Electronic and Information Engineering
来源
Multimedia Systems | 2022年 / 28卷
关键词
Anomaly detection; Skeleton trajectory prediction; Skeleton-Transformer; Multi-head self-attention; Temporal convolutional layer;
D O I
暂无
中图分类号
学科分类号
摘要
Video anomaly detection detects video contents that do not conform to normal patterns offered by the training set. Because appearance-based features are susceptible to background interference, unlike most papers applying appearance-based methods, this paper proposes a novel Skeleton-Transformer (SkT) to predict future pose components in video frames and take errors between predicted pose components and corresponding expected values as anomaly scores. In SkT, we apply the multi-head self-attention (MSA) module and temporal convolutional layer (TCL), which are complementary because they focus on processing information from different viewpoints, to compose a skeleton attention (SkA) block. The MSA module can capture long-range dependencies between arbitrary pairwise pose components on spatial and temporal dimensions from different perspectives, while the TCL concentrates on local temporal information. Finally, multiple SkA blocks are stacked to form the major constituent of the SkT. To the best of our knowledge, the proposed approach is the first work applying Transformer framework to anomaly detection based on pose components, and we conduct experiments to determine the optimal structure. The proposed method achieves a frame-level AUC of 77.65% on the HR-ShanghaiTech dataset, exceeding state-of-the-art methods. Moreover, ablation studies validate each module’s effectiveness in the SkT, further verifying that the Transformer-based method is promising for anomaly detection.
引用
收藏
页码:1481 / 1494
页数:13
相关论文
共 50 条
  • [31] Corner detection using morphological skeleton: An efficient and nonparametric approach
    Dinesh, R
    Guru, DS
    COMPUTER VISION - ACCV 2006, PT II, 2006, 3852 : 752 - 760
  • [32] Human Abnormal Behavior Detection Based on RGBD Video's Skeleton Information Entropy
    Bian, Ziyang
    Xu, Tingfa
    Su, Chang
    Luo, Xuan
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, 2016, 386 : 715 - 723
  • [33] Transformer Based Sptial-Temporal Extraction Model for Video Anomaly Detection
    Wang, Zhiqiang
    Gu, Xiaojing
    Gu, Xingsheng
    2024 8TH INTERNATIONAL CONFERENCE ON ROBOTICS, CONTROL AND AUTOMATION, ICRCA 2024, 2024, : 370 - 374
  • [34] Power Consumption Predicting and Anomaly Detection Based on Transformer and K-Means
    Zhang, Junfeng
    Zhang, Hui
    Ding, Song
    Zhang, Xiaoxiong
    FRONTIERS IN ENERGY RESEARCH, 2021, 9
  • [35] Power Consumption Predicting and Anomaly Detection Based on Transformer and K-Means
    Zhang, Junfeng
    Zhang, Hui
    Ding, Song
    Zhang, Xiaoxiong
    Ding, Song (dingsong1129@163.com), 1600, Frontiers Media S.A. (09):
  • [36] Shape Matching using Skeleton Context for Automated Bow Echo Detection
    Kamani, Mohammad Mahdi
    Farhat, Farshid
    Wistar, Stephen
    Wang, James Z.
    2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 901 - 908
  • [37] A Skeleton Analysis Based Fall Detection Method Using ToF Camera
    Kong, Xiangbo
    Kumaki, Takeshi
    Meng, Lin
    Tomiyama, Hiroyuki
    2020 INTERNATIONAL CONFERENCE ON IDENTIFICATION, INFORMATION AND KNOWLEDGE IN THE INTERNET OF THINGS (IIKI2020), 2021, 187 : 252 - 257
  • [38] Hand Detection and Tracking Using the Skeleton of the Blob for Medical Rehabilitation Applications
    Gil-Jimenez, Pedro
    Losilla-Lopez, Beatriz
    Torres-Cueco, Rafael
    Campilho, Aurelio
    Lopez-Sastre, Roberto
    IMAGE ANALYSIS AND RECOGNITION, PT II, 2012, 7325 : 130 - 137
  • [39] Multiscale spatial temporal attention graph convolution network for skeleton-based anomaly behavior detection
    Chen, Xiaoyu
    Kan, Shichao
    Zhang, Fanghui
    Cen, Yigang
    Zhang, Linna
    Zhang, Damin
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 90
  • [40] Frame-Wise Action Recognition Training Framework for Skeleton-Based Anomaly Behavior Detection
    Tani, Hiroaki
    Shibata, Tomoyuki
    IMAGE ANALYSIS AND PROCESSING, ICIAP 2022, PT III, 2022, 13233 : 312 - 323