Multi-stream adaptive 3D attention graph convolution network for skeleton-based action recognition

被引:0
|
作者
Lubin Yu
Lianfang Tian
Qiliang Du
Jameel Ahmed Bhutto
机构
[1] South China University of Technology,School of Automation Science and Engineering
[2] The Fifth Electronics Research Institute of Ministry of Industry and Information Technology,School of Computer
[3] Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai),undefined
[4] Sino-Singapore International Joint Research Institute,undefined
[5] Key Laboratory of Autonomous Systems and Network Control of Ministry of Education,undefined
[6] Huanggang Normal University,undefined
来源
Applied Intelligence | 2023年 / 53卷
关键词
Graph convolution; Convolutional Neural Network; Adaptive; Attention module; Action recognition;
D O I
暂无
中图分类号
学科分类号
摘要
Action recognition methods based on spatial-temporal skeleton graphs have been applied extensively. The spatial and temporal graphs are generally modeled individually in previous approaches. Recently, many researchers capture the correlation information of temporal and spatial dimensions in spatial-temporal graphs. However, the existing methods have several issues such as 1. The existing modal graphs are defined based on the human body structure which is not flexible enough; 2. The approach to extracting non-local neighborhood features is insufficiently powerful; 3. Attention modules are limited to a single scale; 4. The fusion of multiple data streams is not sufficiently effective. This work proposes a novel multi-stream adaptive 3D attention graph convolution network for skeleton-based action recognition that improves the aforementioned issues. The method utilizes an adaptive topology graph with an adaptive connection coefficient to adaptively optimize the topology of the graph during the training process according to the input data. An optimal high-order adjacency matrix is constructed in our work to balance the weight bias, which captures non-local neighborhood features precisely. Moreover, we design a multi-scale attention mechanism to aggregate information from multiple ranges, which makes the graph convolution focus on more efficient nodes, frames, and channels. To further improve the performance of the model, a novel multi-stream framework is proposed to aggregate the high-order information of the skeleton. The experiment results on the NTU-RGBD and Kinetics-Skeleton prove that our proposed method reveals better results than existing methods.
引用
收藏
页码:14838 / 14854
页数:16
相关论文
共 50 条
  • [1] Multi-stream adaptive 3D attention graph convolution network for skeleton-based action recognition
    Yu, Lubin
    Tian, Lianfang
    Du, Qiliang
    Bhutto, Jameel Ahmed
    [J]. APPLIED INTELLIGENCE, 2023, 53 (12) : 14838 - 14854
  • [2] Skeleton-based multi-stream adaptive-attentional sub-graph convolution network for action recognition
    Liu, Huan
    Wu, Jian
    Ma, Haokai
    Yan, Yuqi
    He, Rui
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (1) : 2935 - 2958
  • [3] Skeleton-based multi-stream adaptive-attentional sub-graph convolution network for action recognition
    Huan Liu
    Jian Wu
    Haokai Ma
    Yuqi Yan
    Rui He
    [J]. Multimedia Tools and Applications, 2024, 83 : 2935 - 2958
  • [4] Multi-stream adaptive spatial-temporal attention graph convolutional network for skeleton-based action recognition
    Yu, Lubin
    Tian, Lianfang
    Du, Qiliang
    Bhutto, Jameel Ahmed
    [J]. IET COMPUTER VISION, 2022, 16 (02) : 143 - 158
  • [5] Skeleton-Based Action Recognition With Multi-Stream Adaptive Graph Convolutional Networks
    Shi, Lei
    Zhang, Yifan
    Cheng, Jian
    Lu, Hanqing
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 9532 - 9545
  • [6] Fusing angular features for skeleton-based action recognition using multi-stream graph convolution network
    Huang, Qian
    Liu, Wenting
    Shang, Mingzhou
    Wang, Yiming
    [J]. IET IMAGE PROCESSING, 2024, 18 (07) : 1694 - 1709
  • [7] Multi-Stream and Enhanced Spatial-Temporal Graph Convolution Network for Skeleton-Based Action Recognition
    Li, Fanjia
    Zhu, Aichun
    Xu, Yonggang
    Cui, Ran
    Hua, Gang
    [J]. IEEE ACCESS, 2020, 8 : 97757 - 97770
  • [8] Multi-stream ternary enhanced graph convolutional network for skeleton-based action recognition
    Kong, Jun
    Wang, Shengquan
    Jiang, Min
    Liu, TianShan
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (25): : 18487 - 18504
  • [9] Multi-stream ternary enhanced graph convolutional network for skeleton-based action recognition
    Jun Kong
    Shengquan Wang
    Min Jiang
    TianShan Liu
    [J]. Neural Computing and Applications, 2023, 35 : 18487 - 18504
  • [10] Skeleton-based action recognition with multi-stream, multi-scale dilated spatial-temporal graph convolution network
    Zhang, Haiping
    Liu, Xu
    Yu, Dongjin
    Guan, Liming
    Wang, Dongjing
    Ma, Conghao
    Hu, Zepeng
    [J]. APPLIED INTELLIGENCE, 2023, 53 (14) : 17629 - 17643