Multi-stream slowFast graph convolutional networks for skeleton-based action recognition

被引:22
|
作者
Sun, Ning [1 ]
Leng, Ling [2 ]
Liu, Jixin [1 ]
Han, Guang [1 ]
机构
[1] Nanjing Univ Posts & Telecommun, Engn Res Ctr Wideband Wireless Commun Technol, Minist Educ, Nanjing 210003, Peoples R China
[2] Nanjing Univ Posts & Telecommun, Coll Commun & Informat Engn, Nanjing 210003, Peoples R China
基金
中国国家自然科学基金;
关键词
Action recognition; Graph convolutional network; Human skeleton; SlowFast network; Attention;
D O I
10.1016/j.imavis.2021.104141
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, many efforts have been made to model spatial-temporal features from human skeleton for action recognition by using graph convolutional networks (GCN). Skeleton sequence can precisely represent human pose with a small number of joints while there is still a lot of redundancies across the skeleton sequence in the term of temporal dependency. In order to improve the effectiveness of spatial-temporal feature extraction from skeleton sequence, a SlowFast graph convolution network (SF-GCN) is proposed by implementing the architecture of SlowFast network, which is consisted of the Fast and Slow pathway, in the GCN model. The Fast pathway is a temporal attention embedded lightweight GCN for extracting the feature of fast temporal changes from the skeleton sequence with a high frame rate and fast refreshing speed. The Slow pathway is a spatial attention embedded GCN for extracting the feature of slow temporal changes from the skeleton sequence with a low frame rate and slow refreshing speed. The features of two pathways are fused by using lateral connection and weighted by using channel attention. Based on the aforementioned design, SF-GCN can achieve superior ability of feature extraction while the computational cost significantly drops. In addition to the coordinate information of joints, five high order sequences including edge, the spatial difference and temporal difference of joints and edges are induced to enhance the representation of human action. Six SF-GCNs are implemented for extracting spatial- temporal feature from six kinds of sequences and fused for skeleton-based action recognition, which is called multi-stream SlowFast graph convolutional networks (MSSF-GCN). Extensive experiments are conducted to evaluate the proposed method on three skeleton-based action recognition databases including NTU RGB + D, NTU RGB + D 120, and Skeleton-Kinetics. The results show that the proposed method is effective for skeleton based action recognition and can achieve the recognition accuracy with an obvious advantage in comparison with the state-of-the-art. (c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] Adaptive Attention Memory Graph Convolutional Networks for Skeleton-Based Action Recognition
    Liu, Di
    Xu, Hui
    Wang, Jianzhong
    Lu, Yinghua
    Kong, Jun
    Qi, Miao
    [J]. SENSORS, 2021, 21 (20)
  • [42] Skeleton-based action recognition by part-aware graph convolutional networks
    Qin, Yang
    Mo, Lingfei
    Li, Chenyang
    Luo, Jiayi
    [J]. VISUAL COMPUTER, 2020, 36 (03): : 621 - 631
  • [43] Actional-Structural Graph Convolutional Networks for Skeleton-based Action Recognition
    Li, Maosen
    Chen, Siheng
    Chen, Xu
    Zhang, Ya
    Wang, Yanfeng
    Tian, Qi
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3590 - 3598
  • [44] Skeleton-Based Action Recognition With Focusing-Diffusion Graph Convolutional Networks
    Gao, Jialin
    He, Tong
    Zhou, Xi
    Ge, Shiming
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 2058 - 2062
  • [45] Pose-Guided Graph Convolutional Networks for Skeleton-Based Action Recognition
    Chen, Han
    Jiang, Yifan
    Ko, Hanseok
    [J]. IEEE ACCESS, 2022, 10 : 111725 - 111731
  • [46] A comparative review of graph convolutional networks for human skeleton-based action recognition
    Liqi Feng
    Yaqin Zhao
    Wenxuan Zhao
    Jiaxi Tang
    [J]. Artificial Intelligence Review, 2022, 55 : 4275 - 4305
  • [47] A comparative review of graph convolutional networks for human skeleton-based action recognition
    Feng, Liqi
    Zhao, Yaqin
    Zhao, Wenxuan
    Tang, Jiaxi
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2022, 55 (05) : 4275 - 4305
  • [48] Dual-domain graph convolutional networks for skeleton-based action recognition
    Shuo Chen
    Ke Xu
    Zhongjie Mi
    Xinghao Jiang
    Tanfeng Sun
    [J]. Machine Learning, 2022, 111 : 2381 - 2406
  • [49] Skeleton-based action recognition by part-aware graph convolutional networks
    Yang Qin
    Lingfei Mo
    Chenyang Li
    Jiayi Luo
    [J]. The Visual Computer, 2020, 36 : 621 - 631
  • [50] SPATIOTEMPORAL-SPECTRAL GRAPH CONVOLUTIONAL NETWORKS FOR SKELETON-BASED ACTION RECOGNITION
    Chen, Shuo
    Xu, Ke
    Jiang, Xinghao
    Sun, Tanfeng
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2021,