Combining channel-wise joint attention and temporal attention in graph convolutional networks for skeleton-based action recognition

被引:1
|
作者
Sun, Zhonghua [1 ,2 ,3 ]
Wang, Tianyi [1 ]
Dai, Meng [1 ]
机构
[1] Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[2] Beijing Lab Adv Informat Networks, Beijing 100124, Peoples R China
[3] Beijing Univ Technol, Beijing Key Lab Computat Intelligence & Intelligen, Beijing 100124, Peoples R China
关键词
Skeleton-based action recognition; Graph convolutional network; Channel-wise joints attention; Temporal attention;
D O I
10.1007/s11760-022-02465-z
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Graph convolutional networks (GCNs) have been shown to be effective in performing skeleton-based action recognition, as graph topology has advantages in representing the natural connectivity of the human bodies. Nevertheless, it is challenging to effectively model the human joints spatially and temporally, and we are lacking attentional mechanisms for critical temporal frames and important skeletal points. In this work, we propose a novel GCNs combined with channel-wise joints and temporal attention for skeleton-based action recognition. Our temporal attention module captures the long-term dependence of time and then enhances the temporal semantics of key frames. In addition, we design a channel-wise attention module that fuses multi-channel joint weights with the topological map to capture the attention of nodes at different actions along the channel dimension. We propose to concatenate joint and bone together along the channel dimension as the joint & bone (J & B) modality, J & B modality can extract hybrid action patterns under the coalition of channel-wise joint attention. We prove the powerful spatio-temporal modeling capability of our model on three widely used dataset, NTU-RGB D, NTU RGB+D 120 and Northwestern-UCLA. Compared with leading GCN-based methods, we achieve performance comparable to the-state-of-art.
引用
收藏
页码:2481 / 2488
页数:8
相关论文
共 50 条
  • [41] Recurrent graph convolutional networks for skeleton-based action recognition
    Zhu, Guangming
    Yang, Lu
    Zhang, Liang
    Shen, Peiyi
    Song, Juan
    Proceedings - International Conference on Pattern Recognition, 2020, : 1352 - 1359
  • [42] Skeleton-Based Action Recognition with Combined Part-Wise Topology Graph Convolutional Networks
    Zhu, Xiaowei
    Huang, Qian
    Li, Chang
    Cui, Jingwen
    Chen, Yingying
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT I, 2024, 14425 : 43 - 59
  • [43] Recurrent Graph Convolutional Networks for Skeleton-based Action Recognition
    Zhu, Guangming
    Yang, Lu
    Zhang, Liang
    Shen, Peiyi
    Song, Juan
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 1352 - 1359
  • [44] Prompt-supervised dynamic attention graph convolutional network for skeleton-based action recognition
    Zhu, Shasha
    Sun, Lu
    Ma, Zeyuan
    Li, Chenxi
    He, Dongzhi
    NEUROCOMPUTING, 2025, 611
  • [45] Temporal-Channel Attention and Convolution Fusion for Skeleton-Based Human Action Recognition
    Liang, Chengwu
    Yang, Jie
    Du, Ruolin
    Hu, Wei
    Hou, Ning
    IEEE ACCESS, 2024, 12 : 64937 - 64948
  • [46] Graph convolutional network with structure pooling and joint-wise channel attention for action recognition
    Chen, Yuxin
    Ma, Gaoqun
    Yuan, Chunfeng
    Li, Bing
    Zhang, Hui
    Wang, Fangshi
    Hu, Weiming
    PATTERN RECOGNITION, 2020, 103
  • [47] Spatio-Temporal Inception Graph Convolutional Networks for Skeleton-Based Action Recognition
    Huang, Zhen
    Shen, Xu
    Tian, Xinmei
    Li, Houqiang
    Huang, Jianqiang
    Hua, Xian-Sheng
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 2122 - 2130
  • [48] Multi-scale Dilated Attention Graph Convolutional Network for Skeleton-Based Action Recognition
    Shu, Yang
    Li, Wanggen
    Li, Doudou
    Gao, Kun
    Jie, Biao
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT I, 2024, 14425 : 16 - 28
  • [49] Two Stream Multi-Attention Graph Convolutional Network for Skeleton-Based Action Recognition
    Zhou, Huijian
    Tian, Zhiqiang
    Du, Shaoyi
    ARTIFICIAL INTELLIGENCE AND ROBOTICS, ISAIR 2023, 2024, 1998 : 112 - 120
  • [50] Multi-stream adaptive spatial-temporal attention graph convolutional network for skeleton-based action recognition
    Yu, Lubin
    Tian, Lianfang
    Du, Qiliang
    Bhutto, Jameel Ahmed
    IET COMPUTER VISION, 2022, 16 (02) : 143 - 158