Combining channel-wise joint attention and temporal attention in graph convolutional networks for skeleton-based action recognition

被引:1
|
作者
Sun, Zhonghua [1 ,2 ,3 ]
Wang, Tianyi [1 ]
Dai, Meng [1 ]
机构
[1] Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[2] Beijing Lab Adv Informat Networks, Beijing 100124, Peoples R China
[3] Beijing Univ Technol, Beijing Key Lab Computat Intelligence & Intelligen, Beijing 100124, Peoples R China
关键词
Skeleton-based action recognition; Graph convolutional network; Channel-wise joints attention; Temporal attention;
D O I
10.1007/s11760-022-02465-z
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Graph convolutional networks (GCNs) have been shown to be effective in performing skeleton-based action recognition, as graph topology has advantages in representing the natural connectivity of the human bodies. Nevertheless, it is challenging to effectively model the human joints spatially and temporally, and we are lacking attentional mechanisms for critical temporal frames and important skeletal points. In this work, we propose a novel GCNs combined with channel-wise joints and temporal attention for skeleton-based action recognition. Our temporal attention module captures the long-term dependence of time and then enhances the temporal semantics of key frames. In addition, we design a channel-wise attention module that fuses multi-channel joint weights with the topological map to capture the attention of nodes at different actions along the channel dimension. We propose to concatenate joint and bone together along the channel dimension as the joint & bone (J & B) modality, J & B modality can extract hybrid action patterns under the coalition of channel-wise joint attention. We prove the powerful spatio-temporal modeling capability of our model on three widely used dataset, NTU-RGB D, NTU RGB+D 120 and Northwestern-UCLA. Compared with leading GCN-based methods, we achieve performance comparable to the-state-of-art.
引用
收藏
页码:2481 / 2488
页数:8
相关论文
共 50 条
  • [21] Channel attention and multi-scale graph neural networks for skeleton-based action recognition
    Dang, Ronghao
    Liu, Chengju
    Liu, Ming
    Chen, Qijun
    AI COMMUNICATIONS, 2022, 35 (03) : 187 - 205
  • [22] Temporal Attention-Augmented Graph Convolutional Network for Efficient Skeleton-Based Human Action Recognition
    Heidari, Negar
    Iosifidis, Alexandros
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 7907 - 7914
  • [23] A graph convolutional neural network model with Fisher vector encoding and channel-wise spatial-temporal aggregation for skeleton-based action recognition
    Tang, Jun
    Wang, Yanjiang
    Fu, Sichao
    Liu, Baodi
    Liu, Weifeng
    IET IMAGE PROCESSING, 2022, 16 (05) : 1433 - 1443
  • [24] Spatial–Temporal gated graph attention network for skeleton-based action recognition
    Mrugendrasinh Rahevar
    Amit Ganatra
    Pattern Analysis and Applications, 2023, 26 (3) : 929 - 939
  • [25] Graph transformer network with temporal kernel attention for skeleton-based action recognition
    Department of Computer Science and Engineering, School of Information Science and Engineering, Yunnan University, Kunming
    650504, China
    Knowl Based Syst,
  • [26] Graph transformer network with temporal kernel attention for skeleton-based action recognition
    Liu, Yanan
    Zhang, Hao
    Xu, Dan
    He, Kangjian
    KNOWLEDGE-BASED SYSTEMS, 2022, 240
  • [27] Spatial-temporal channel-wise attention network for action recognition
    Lin Chen
    Yungang Liu
    Yongchao Man
    Multimedia Tools and Applications, 2021, 80 : 21789 - 21808
  • [28] Spatial-temporal channel-wise attention network for action recognition
    Chen, Lin
    Liu, Yungang
    Man, Yongchao
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (14) : 21789 - 21808
  • [29] View transform graph attention recurrent networks for skeleton-based action recognition
    Huang, Qingqing
    Zhou, Fengyu
    Qin, Runze
    Zhao, Yang
    SIGNAL IMAGE AND VIDEO PROCESSING, 2021, 15 (03) : 599 - 606
  • [30] View transform graph attention recurrent networks for skeleton-based action recognition
    Qingqing Huang
    Fengyu Zhou
    Runze Qin
    Yang zhao
    Signal, Image and Video Processing, 2021, 15 : 599 - 606