Spatial-temporal slowfast graph convolutional network for skeleton-based action recognition

被引:9
|
作者
Fang, Zheng [1 ]
Zhang, Xiongwei [1 ]
Cao, Tieyong [1 ,2 ]
Zheng, Yunfei [1 ]
Sun, Meng [1 ]
机构
[1] Peoples Liberat Army Engn Univ, Inst Command & Control Engn, Nanjing 210001, Jiangsu, Peoples R China
[2] Army Artillery & Def Acad PLA Nanjing, Nanjing, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
computer vision; graph theory; video signal processing; video signals;
D O I
10.1049/cvi2.12080
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In skeleton-based action recognition, the graph convolutional network (GCN) has achieved great success. Modelling skeleton data in a suitable spatial-temporal way and designing the adjacency matrix are crucial aspects for GCN-based methods to capture joint relationships. In this study, we propose the spatial-temporal slowfast graph convolutional network (STSF-GCN) and design the adjacency matrices for the skeleton data graphs in STSF-GCN. STSF-GCN contains two pathways: (1) the fast pathway is in a high frame rate, and joints of adjacent frames are unified to build 'small' spatial-temporal graphs. A new spatial-temporal adjacency matrix is proposed for these 'small' spatial-temporal graphs. Ablation studies verify the effectiveness of the proposed adjacency matrix. (2) The slow pathway is in a low frame rate, and joints from all frames are unified to build one 'big' spatial-temporal graph. The adjacency matrix for the 'big' spatial-temporal graph is obtained by computing self-attention coefficients of each joint. Finally, outputs from two pathways are fused to predict the action category. STSF-GCN can efficiently capture both long-range and short-range spatial-temporal joint relationships. On three datasets for skeleton-based action recognition, STSF-GCN can achieve state-of-the-art performance with much less computational cost.
引用
收藏
页码:205 / 217
页数:13
相关论文
共 50 条
  • [41] Shuffle Graph Convolutional Network for Skeleton-Based Action Recognition
    Yu, Qiwei
    Dai, Yaping
    Hirota, Kaoru
    Shao, Shuai
    Dai, Wei
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2023, 27 (05) : 790 - 800
  • [42] Feedback Graph Convolutional Network for Skeleton-Based Action Recognition
    Yang, Hao
    Yan, Dan
    Zhang, Li
    Sun, Yunda
    Li, Dong
    Maybank, Stephen J.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 164 - 175
  • [43] Hierarchical Graph Convolutional Network for Skeleton-Based Action Recognition
    Huang, Linjiang
    Huang, Yan
    Ouyang, Wanli
    Wang, Liang
    IMAGE AND GRAPHICS, ICIG 2019, PT I, 2019, 11901 : 93 - 102
  • [44] Dynamic Semantic-Based Spatial-Temporal Graph Convolution Network for Skeleton-Based Human Action Recognition
    Xie, Jianyang
    Meng, Yanda
    Zhao, Yitian
    Nguyen, Anh
    Yang, Xiaoyun
    Zheng, Yalin
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 6691 - 6704
  • [45] Multiple temporal scale aggregation graph convolutional network for skeleton-based action recognition
    Li, Xuanfeng
    Lu, Jian
    Zhou, Jian
    Liu, Wei
    Zhang, Kaibing
    COMPUTERS & ELECTRICAL ENGINEERING, 2023, 110
  • [46] Fast Temporal Graph Convolutional Model for Skeleton-Based Action Recognition
    Nan, Mihai
    Florea, Adina Magda
    SENSORS, 2022, 22 (19)
  • [47] Multi-scale Spatial and Temporal Feature Aggregation Graph Convolutional Network for Skeleton-Based Action Recognition
    Du, Yifei
    Zhang, Mingliang
    Li, Bin
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT VII, 2025, 15037 : 511 - 524
  • [48] Temporal segment graph convolutional networks for skeleton-based action recognition
    Ding, Chongyang
    Wen, Shan
    Ding, Wenwen
    Liu, Kai
    Belyaev, Evgeny
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 110
  • [49] Hierarchical Spatial-Temporal Network for Skeleton-Based Temporal Action Segmentation
    Tan, Chenwei
    Sun, Tao
    Fu, Talas
    Wang, Yuhan
    Xu, Minjie
    Liu, Shenglan
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT X, 2024, 14434 : 28 - 39
  • [50] A motion-aware and temporal-enhanced Spatial-Temporal Graph Convolutional Network for skeleton-based human action segmentation
    Chai, Shurong
    Jain, Rahul Kumar
    Liu, Jiaqing
    Teng, Shiyu
    Tateyama, Tomoko
    Li, Yinhao
    Chen, Yen -Wei
    NEUROCOMPUTING, 2024, 580