Spatial-temporal slowfast graph convolutional network for skeleton-based action recognition

被引:9
|
作者
Fang, Zheng [1 ]
Zhang, Xiongwei [1 ]
Cao, Tieyong [1 ,2 ]
Zheng, Yunfei [1 ]
Sun, Meng [1 ]
机构
[1] Peoples Liberat Army Engn Univ, Inst Command & Control Engn, Nanjing 210001, Jiangsu, Peoples R China
[2] Army Artillery & Def Acad PLA Nanjing, Nanjing, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
computer vision; graph theory; video signal processing; video signals;
D O I
10.1049/cvi2.12080
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In skeleton-based action recognition, the graph convolutional network (GCN) has achieved great success. Modelling skeleton data in a suitable spatial-temporal way and designing the adjacency matrix are crucial aspects for GCN-based methods to capture joint relationships. In this study, we propose the spatial-temporal slowfast graph convolutional network (STSF-GCN) and design the adjacency matrices for the skeleton data graphs in STSF-GCN. STSF-GCN contains two pathways: (1) the fast pathway is in a high frame rate, and joints of adjacent frames are unified to build 'small' spatial-temporal graphs. A new spatial-temporal adjacency matrix is proposed for these 'small' spatial-temporal graphs. Ablation studies verify the effectiveness of the proposed adjacency matrix. (2) The slow pathway is in a low frame rate, and joints from all frames are unified to build one 'big' spatial-temporal graph. The adjacency matrix for the 'big' spatial-temporal graph is obtained by computing self-attention coefficients of each joint. Finally, outputs from two pathways are fused to predict the action category. STSF-GCN can efficiently capture both long-range and short-range spatial-temporal joint relationships. On three datasets for skeleton-based action recognition, STSF-GCN can achieve state-of-the-art performance with much less computational cost.
引用
收藏
页码:205 / 217
页数:13
相关论文
共 50 条
  • [21] Spatial-temporal graph transformer network for skeleton-based temporal action segmentation
    Xiaoyan Tian
    Ye Jin
    Zhao Zhang
    Peng Liu
    Xianglong Tang
    Multimedia Tools and Applications, 2024, 83 : 44273 - 44297
  • [22] Spatial-temporal graph transformer network for skeleton-based temporal action segmentation
    Tian, Xiaoyan
    Jin, Ye
    Zhang, Zhao
    Liu, Peng
    Tang, Xianglong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (15) : 44273 - 44297
  • [23] Multi-scale spatial-temporal convolutional neural network for skeleton-based action recognition
    Cheng, Qin
    Cheng, Jun
    Ren, Ziliang
    Zhang, Qieshi
    Liu, Jianming
    PATTERN ANALYSIS AND APPLICATIONS, 2023, 26 (03) : 1303 - 1315
  • [24] Multi-Scale Spatial Temporal Graph Convolutional Network for Skeleton-Based Action Recognition
    Chen, Zhan
    Li, Sicheng
    Yang, Bing
    Li, Qinghan
    LiU, Hong
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 1113 - 1122
  • [25] Multi-stream slowFast graph convolutional networks for skeleton-based action recognition
    Sun, Ning
    Leng, Ling
    Liu, Jixin
    Han, Guang
    IMAGE AND VISION COMPUTING, 2021, 109
  • [26] Spatial-temporal graph neural ODE networks for skeleton-based action recognition
    Pan, Longji
    Lu, Jianguang
    Tang, Xianghong
    SCIENTIFIC REPORTS, 2024, 14 (01)
  • [27] A Separable Spatial-Temporal Graph Learning Approach for Skeleton-Based Action Recognition
    Zheng, Hui
    Zhao, Ye-Sheng
    Zhang, Bo
    Shang, Guo-Qiang
    IEEE SENSORS LETTERS, 2024, 8 (11)
  • [28] Spatial-temporal graph neural ODE networks for skeleton-based action recognition
    Longji Pan
    Jianguang Lu
    Xianghong Tang
    Scientific Reports, 14
  • [29] Temporal Receptive Field Graph Convolutional Network for Skeleton-Based Action Recognition
    Zhang, Qingqi
    Wu, Ren
    Nakata, Mitsuru
    Ge, Qi-Wei
    2024 International Technical Conference on Circuits/Systems, Computers, and Communications, ITC-CSCC 2024, 2024,
  • [30] Temporal Receptive Field Graph Convolutional Network for Skeleton-based Action Recognition
    Zhang, Qingqi
    Wu, Ren
    Nakata, Mitsuru
    Ge, Qi-Wei
    2024 INTERNATIONAL TECHNICAL CONFERENCE ON CIRCUITS/SYSTEMS, COMPUTERS, AND COMMUNICATIONS, ITC-CSCC 2024, 2024,