Predicting skeleton trajectories using a Skeleton-Transformer for video anomaly detection

被引:0
|
作者
Wenfeng Pang
Qianhua He
Yanxiong Li
机构
[1] South China University of Technology,School of Electronic and Information Engineering
来源
Multimedia Systems | 2022年 / 28卷
关键词
Anomaly detection; Skeleton trajectory prediction; Skeleton-Transformer; Multi-head self-attention; Temporal convolutional layer;
D O I
暂无
中图分类号
学科分类号
摘要
Video anomaly detection detects video contents that do not conform to normal patterns offered by the training set. Because appearance-based features are susceptible to background interference, unlike most papers applying appearance-based methods, this paper proposes a novel Skeleton-Transformer (SkT) to predict future pose components in video frames and take errors between predicted pose components and corresponding expected values as anomaly scores. In SkT, we apply the multi-head self-attention (MSA) module and temporal convolutional layer (TCL), which are complementary because they focus on processing information from different viewpoints, to compose a skeleton attention (SkA) block. The MSA module can capture long-range dependencies between arbitrary pairwise pose components on spatial and temporal dimensions from different perspectives, while the TCL concentrates on local temporal information. Finally, multiple SkA blocks are stacked to form the major constituent of the SkT. To the best of our knowledge, the proposed approach is the first work applying Transformer framework to anomaly detection based on pose components, and we conduct experiments to determine the optimal structure. The proposed method achieves a frame-level AUC of 77.65% on the HR-ShanghaiTech dataset, exceeding state-of-the-art methods. Moreover, ablation studies validate each module’s effectiveness in the SkT, further verifying that the Transformer-based method is promising for anomaly detection.
引用
收藏
页码:1481 / 1494
页数:13
相关论文
共 50 条
  • [1] Predicting skeleton trajectories using a Skeleton-Transformer for video anomaly detection
    Pang, Wenfeng
    He, Qianhua
    Li, Yanxiong
    MULTIMEDIA SYSTEMS, 2022, 28 (04) : 1481 - 1494
  • [2] Video anomaly detection using CycleGan based on skeleton features
    Fan, Zheyi
    Yi, Shuhan
    Wu, Di
    Song, Yu
    Cui, Mengjie
    Liu, Zhiwen
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 85
  • [3] Learning Regularity in Skeleton Trajectories for Anomaly Detection in Videos
    Morais, Romero
    Vuong Le
    Truyen Tran
    Saha, Budhaditya
    Mansour, Moussa
    Venkatesh, Svetha
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 11988 - 11996
  • [4] Driver Anomaly Detection Using Skeleton Images
    Fusek, Radovan
    Sojka, Eduard
    Gaura, Jan
    Halman, Jakub
    ADVANCES IN VISUAL COMPUTING, ISVC 2023, PT I, 2023, 14361 : 459 - 471
  • [5] Detection of Elderly Falls in Video Streams Using Skeleton Key Points and Transformer Networks
    Reis, Matteo
    Rojas, Yunevda E. Leon
    Gatto, Bernardo B.
    Colonna, Juan G.
    2024 37TH SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES, SIBGRAPI 2024, 2024, : 253 - 258
  • [6] TransAnomaly: Video Anomaly Detection Using Video Vision Transformer
    Yuan, Hongchun
    Cai, Zhenyu
    Zhou, Hui
    Wang, Yue
    Chen, Xiangzhi
    IEEE ACCESS, 2021, 9 : 123977 - 123986
  • [7] Multimodal Motion Conditioned Diffusion Model for Skeleton-based Video Anomaly Detection
    Flaborea, Alessandro
    Collorone, Luca
    di Melendugno, Guido Maria D'Amely
    D'Arrigo, Stefano
    Prenkaj, Bardh
    Galasso, Fabio
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 10284 - 10295
  • [8] Physical Violence Detection in Video Streaming Using Partitioned Skeleton Analysis
    Narynov, Sergazy
    Zhumanov, Zhandos
    Gumar, Aidana
    Khassanova, Mariyam
    Omarov, Batyrkhan
    2021 21ST INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2021), 2021, : 225 - 230
  • [9] Sign Language Video Synthesis using Skeleton Sequence
    Gencoglu, Sinan
    Keles, Hacer Yalim
    2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
  • [10] ACTION SEGMENTATION ON REPRESENTATIONS OF SKELETON SEQUENCES USING TRANSFORMER NETWORKS
    Haering, Simon
    Memmesheimer, Raphael
    Paulus, Dietrich
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 3053 - 3057