Multi-stream slowFast graph convolutional networks for skeleton-based action recognition

被引:22
|
作者
Sun, Ning [1 ]
Leng, Ling [2 ]
Liu, Jixin [1 ]
Han, Guang [1 ]
机构
[1] Nanjing Univ Posts & Telecommun, Engn Res Ctr Wideband Wireless Commun Technol, Minist Educ, Nanjing 210003, Peoples R China
[2] Nanjing Univ Posts & Telecommun, Coll Commun & Informat Engn, Nanjing 210003, Peoples R China
基金
中国国家自然科学基金;
关键词
Action recognition; Graph convolutional network; Human skeleton; SlowFast network; Attention;
D O I
10.1016/j.imavis.2021.104141
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, many efforts have been made to model spatial-temporal features from human skeleton for action recognition by using graph convolutional networks (GCN). Skeleton sequence can precisely represent human pose with a small number of joints while there is still a lot of redundancies across the skeleton sequence in the term of temporal dependency. In order to improve the effectiveness of spatial-temporal feature extraction from skeleton sequence, a SlowFast graph convolution network (SF-GCN) is proposed by implementing the architecture of SlowFast network, which is consisted of the Fast and Slow pathway, in the GCN model. The Fast pathway is a temporal attention embedded lightweight GCN for extracting the feature of fast temporal changes from the skeleton sequence with a high frame rate and fast refreshing speed. The Slow pathway is a spatial attention embedded GCN for extracting the feature of slow temporal changes from the skeleton sequence with a low frame rate and slow refreshing speed. The features of two pathways are fused by using lateral connection and weighted by using channel attention. Based on the aforementioned design, SF-GCN can achieve superior ability of feature extraction while the computational cost significantly drops. In addition to the coordinate information of joints, five high order sequences including edge, the spatial difference and temporal difference of joints and edges are induced to enhance the representation of human action. Six SF-GCNs are implemented for extracting spatial- temporal feature from six kinds of sequences and fused for skeleton-based action recognition, which is called multi-stream SlowFast graph convolutional networks (MSSF-GCN). Extensive experiments are conducted to evaluate the proposed method on three skeleton-based action recognition databases including NTU RGB + D, NTU RGB + D 120, and Skeleton-Kinetics. The results show that the proposed method is effective for skeleton based action recognition and can achieve the recognition accuracy with an obvious advantage in comparison with the state-of-the-art. (c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] Hierarchically Decomposed Graph Convolutional Networks for Skeleton-Based Action Recognition
    Lee, Jungho
    Lee, Minhyeok
    Lee, Dogyoon
    Lee, Sangyoun
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 10410 - 10419
  • [32] Hierarchically Decomposed Graph Convolutional Networks for Skeleton-Based Action Recognition
    Lee, Jungho
    Lee, Minhyeok
    Lee, Dogyoon
    Lee, Sangyoun
    [J]. arXiv, 2022,
  • [33] Multi-Stream General and Graph-Based Deep Neural Networks for Skeleton-Based Sign Language Recognition
    Miah, Abu Saleh Musa
    Hasan, Md. Al Mehedi
    Jang, Si-Woong
    Lee, Hyoun-Sup
    Shin, Jungpil
    [J]. ELECTRONICS, 2023, 12 (13)
  • [34] Graph Convolutional Networks Skeleton-based Action Recognition for Continuous Data Stream: A Sliding Window Approach
    Delamare, Mickael
    Laville, Cyril
    Cabani, Adnane
    Chafouk, Houcine
    [J]. VISAPP: PROCEEDINGS OF THE 16TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS - VOL. 5: VISAPP, 2021, : 427 - 435
  • [35] Multi-stream adaptive 3D attention graph convolution network for skeleton-based action recognition
    Yu, Lubin
    Tian, Lianfang
    Du, Qiliang
    Bhutto, Jameel Ahmed
    [J]. APPLIED INTELLIGENCE, 2023, 53 (12) : 14838 - 14854
  • [36] Multi-scale sampling attention graph convolutional networks for skeleton-based action recognition
    Tian, Haoyu
    Zhang, Yipeng
    Wu, Hanbo
    Ma, Xin
    Li, Yibin
    [J]. NEUROCOMPUTING, 2024, 597
  • [37] Skeleton-based multi-stream adaptive-attentional sub-graph convolution network for action recognition
    Liu, Huan
    Wu, Jian
    Ma, Haokai
    Yan, Yuqi
    He, Rui
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (1) : 2935 - 2958
  • [38] Multi-stream adaptive 3D attention graph convolution network for skeleton-based action recognition
    Lubin Yu
    Lianfang Tian
    Qiliang Du
    Jameel Ahmed Bhutto
    [J]. Applied Intelligence, 2023, 53 : 14838 - 14854
  • [39] Skeleton-based multi-stream adaptive-attentional sub-graph convolution network for action recognition
    Huan Liu
    Jian Wu
    Haokai Ma
    Yuqi Yan
    Rui He
    [J]. Multimedia Tools and Applications, 2024, 83 : 2935 - 2958
  • [40] Dual-domain graph convolutional networks for skeleton-based action recognition
    Chen, Shuo
    Xu, Ke
    Mi, Zhongjie
    Jiang, Xinghao
    Sun, Tanfeng
    [J]. MACHINE LEARNING, 2022, 111 (07) : 2381 - 2406