Predicting skeleton trajectories using a Skeleton-Transformer for video anomaly detection

被引:0
|
作者
Wenfeng Pang
Qianhua He
Yanxiong Li
机构
[1] South China University of Technology,School of Electronic and Information Engineering
来源
Multimedia Systems | 2022年 / 28卷
关键词
Anomaly detection; Skeleton trajectory prediction; Skeleton-Transformer; Multi-head self-attention; Temporal convolutional layer;
D O I
暂无
中图分类号
学科分类号
摘要
Video anomaly detection detects video contents that do not conform to normal patterns offered by the training set. Because appearance-based features are susceptible to background interference, unlike most papers applying appearance-based methods, this paper proposes a novel Skeleton-Transformer (SkT) to predict future pose components in video frames and take errors between predicted pose components and corresponding expected values as anomaly scores. In SkT, we apply the multi-head self-attention (MSA) module and temporal convolutional layer (TCL), which are complementary because they focus on processing information from different viewpoints, to compose a skeleton attention (SkA) block. The MSA module can capture long-range dependencies between arbitrary pairwise pose components on spatial and temporal dimensions from different perspectives, while the TCL concentrates on local temporal information. Finally, multiple SkA blocks are stacked to form the major constituent of the SkT. To the best of our knowledge, the proposed approach is the first work applying Transformer framework to anomaly detection based on pose components, and we conduct experiments to determine the optimal structure. The proposed method achieves a frame-level AUC of 77.65% on the HR-ShanghaiTech dataset, exceeding state-of-the-art methods. Moreover, ablation studies validate each module’s effectiveness in the SkT, further verifying that the Transformer-based method is promising for anomaly detection.
引用
收藏
页码:1481 / 1494
页数:13
相关论文
共 50 条
  • [21] A fast algorithm for corner detection using the morphologic skeleton
    Liu, WY
    Li, H
    Zhu, GX
    PATTERN RECOGNITION LETTERS, 2001, 22 (08) : 891 - 900
  • [22] Shot Detection Using Skeleton Position in Badminton Videos
    Yoshikawa, Yui
    Shishido, Hidehiko
    Suita, Masashi
    Kameda, Yoshinari
    Kitahara, Itaru
    INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY (IWAIT) 2021, 2021, 11766
  • [23] Video Anomaly Detection Using Encoder-Decoder Networks with Video Vision Transformer and Channel Attention Blocks
    Kobayashi, Shimpei
    Hizukuri, Akiyoshi
    Nakayama, Ryohei
    2023 18TH INTERNATIONAL CONFERENCE ON MACHINE VISION AND APPLICATIONS, MVA, 2023,
  • [24] Skeleton joint trajectories based human activity recognition using deep RNN
    Atiya Usmani
    Nadia Siddiqui
    Saiful Islam
    Multimedia Tools and Applications, 2023, 82 : 46845 - 46869
  • [25] Skeleton joint trajectories based human activity recognition using deep RNN
    Usmani, Atiya
    Siddiqui, Nadia
    Islam, Saiful
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (30) : 46845 - 46869
  • [26] Enhancing Video Anomaly Detection Using a Transformer Spatiotemporal Attention Unsupervised Framework for Large Datasets
    Habeb, Mohamed H.
    Salama, May
    Elrefaei, Lamiaa A.
    ALGORITHMS, 2024, 17 (07)
  • [27] Anomaly Detection in Video using Compression
    Smith, Michael R.
    Gooding, Renee
    Bisila, Jonathan
    Ting, Christina
    2024 IEEE 7TH INTERNATIONAL CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL, MIPR 2024, 2024, : 127 - 133
  • [28] TransGANomaly: Transformer based Generative Adversarial Network for Video Anomaly Detection
    Aslam, Nazia
    Kolekar, Maheshkumar H.
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 100
  • [29] Multi-Contextual Predictions with Vision Transformer for Video Anomaly Detection
    Lee, Jooyeon
    Nam, Woo-Jeoung
    Lee, Seong-Whan
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 1012 - 1018
  • [30] Fall Detection and Activity Recognition Using Human Skeleton Features
    Ramirez, Heilym
    Velastin, Sergio A.
    Meza, Ignacio
    Fabregas, Ernesto
    Makris, Dimitrios
    Farias, Gonzalo
    IEEE ACCESS, 2021, 9 (09): : 33532 - 33542