Multi-stream slowFast graph convolutional networks for skeleton-based action recognition

被引:22
|
作者
Sun, Ning [1 ]
Leng, Ling [2 ]
Liu, Jixin [1 ]
Han, Guang [1 ]
机构
[1] Nanjing Univ Posts & Telecommun, Engn Res Ctr Wideband Wireless Commun Technol, Minist Educ, Nanjing 210003, Peoples R China
[2] Nanjing Univ Posts & Telecommun, Coll Commun & Informat Engn, Nanjing 210003, Peoples R China
基金
中国国家自然科学基金;
关键词
Action recognition; Graph convolutional network; Human skeleton; SlowFast network; Attention;
D O I
10.1016/j.imavis.2021.104141
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, many efforts have been made to model spatial-temporal features from human skeleton for action recognition by using graph convolutional networks (GCN). Skeleton sequence can precisely represent human pose with a small number of joints while there is still a lot of redundancies across the skeleton sequence in the term of temporal dependency. In order to improve the effectiveness of spatial-temporal feature extraction from skeleton sequence, a SlowFast graph convolution network (SF-GCN) is proposed by implementing the architecture of SlowFast network, which is consisted of the Fast and Slow pathway, in the GCN model. The Fast pathway is a temporal attention embedded lightweight GCN for extracting the feature of fast temporal changes from the skeleton sequence with a high frame rate and fast refreshing speed. The Slow pathway is a spatial attention embedded GCN for extracting the feature of slow temporal changes from the skeleton sequence with a low frame rate and slow refreshing speed. The features of two pathways are fused by using lateral connection and weighted by using channel attention. Based on the aforementioned design, SF-GCN can achieve superior ability of feature extraction while the computational cost significantly drops. In addition to the coordinate information of joints, five high order sequences including edge, the spatial difference and temporal difference of joints and edges are induced to enhance the representation of human action. Six SF-GCNs are implemented for extracting spatial- temporal feature from six kinds of sequences and fused for skeleton-based action recognition, which is called multi-stream SlowFast graph convolutional networks (MSSF-GCN). Extensive experiments are conducted to evaluate the proposed method on three skeleton-based action recognition databases including NTU RGB + D, NTU RGB + D 120, and Skeleton-Kinetics. The results show that the proposed method is effective for skeleton based action recognition and can achieve the recognition accuracy with an obvious advantage in comparison with the state-of-the-art. (c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Multi-stream mixed graph convolutional networks for skeleton-based action recognition
    Zhuang, Boyuan
    Kong, Jun
    Jiang, Min
    Liu, Tianshan
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2021, 30 (06)
  • [2] Skeleton-Based Action Recognition With Multi-Stream Adaptive Graph Convolutional Networks
    Shi, Lei
    Zhang, Yifan
    Cheng, Jian
    Lu, Hanqing
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 9532 - 9545
  • [3] Multi-stream P&U adaptive graph convolutional networks for skeleton-based action recognition
    Chen, Minglong
    Liang, Jiuzhen
    Liu, Hao
    [J]. JOURNAL OF SUPERCOMPUTING, 2024, 80 (08): : 11614 - 11639
  • [4] Multi-stream P&U adaptive graph convolutional networks for skeleton-based action recognition
    Minglong Chen
    Jiuzhen Liang
    Hao Liu
    [J]. The Journal of Supercomputing, 2024, 80 : 11614 - 11639
  • [5] Multi-stream ternary enhanced graph convolutional network for skeleton-based action recognition
    Kong, Jun
    Wang, Shengquan
    Jiang, Min
    Liu, TianShan
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (25): : 18487 - 18504
  • [6] Multi-stream ternary enhanced graph convolutional network for skeleton-based action recognition
    Jun Kong
    Shengquan Wang
    Min Jiang
    TianShan Liu
    [J]. Neural Computing and Applications, 2023, 35 : 18487 - 18504
  • [7] Multi-stream part-fused graph convolutional networks for skeleton-based gait recognition
    Wang, Likai
    Chen, Jinyan
    Chen, Zhenghang
    Liu, Yuxin
    Yang, Haolin
    [J]. CONNECTION SCIENCE, 2022, 34 (01) : 652 - 669
  • [8] A Multi-Stream Graph Convolutional Networks-Hidden Conditional Random Field Model for Skeleton-Based Action Recognition
    Liu, Kai
    Gao, Lei
    Khan, Naimul Mefraz
    Qi, Lin
    Guan, Ling
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 64 - 76
  • [9] Skeleton-Based Action Recognition Using Multi-Scale and Multi-Stream Improved Graph Convolutional Network
    Li, Wang
    Liu, Xu
    Liu, Zheng
    Du, Feixiang
    Zou, Qiang
    [J]. IEEE ACCESS, 2020, 8 (08): : 144529 - 144542
  • [10] Partially Occluded Skeleton Action Recognition Based on Multi-stream Fusion Graph Convolutional Networks
    Li, Dan
    Shi, Wuzhen
    [J]. ADVANCES IN COMPUTER GRAPHICS, CGI 2021, 2021, 13002 : 178 - 189