Two-Stream Temporal Convolutional Networks for Skeleton-Based Human Action Recognition

被引:0
|
作者
Jin-Gong Jia
Yuan-Feng Zhou
Xing-Wei Hao
Feng Li
Christian Desrosiers
Cai-Ming Zhang
机构
[1] Shandong University,School of Software
[2] University of Quebec,Department of Software and IT Engineering
关键词
skeleton; action recognition; temporal convolutional network (TCN); vector feature representation; neural network;
D O I
暂无
中图分类号
学科分类号
摘要
With the growing popularity of somatosensory interaction devices, human action recognition is becoming attractive in many application scenarios. Skeleton-based action recognition is effective because the skeleton can represent the position and the structure of key points of the human body. In this paper, we leverage spatiotemporal vectors between skeleton sequences as input feature representation of the network, which is more sensitive to changes of the human skeleton compared with representations based on distance and angle features. In addition, we redesign residual blocks that have different strides in the depth of the network to improve the processing ability of the temporal convolutional networks (TCNs) for long time dependent actions. In this work, we propose the two-stream temporal convolutional networks (TS-TCNs) that take full advantage of the inter-frame vector feature and the intra-frame vector feature of skeleton sequences in the spatiotemporal representations. The framework can integrate different feature representations of skeleton sequences so that the two feature representations can make up for each other’s shortcomings. The fusion loss function is used to supervise the training parameters of the two branch networks. Experiments on public datasets show that our network achieves superior performance and attains an improvement of 1.2% over the recent GCN-based (BGC-LSTM) method on the NTU RGB+D dataset.
引用
收藏
页码:538 / 550
页数:12
相关论文
共 50 条
  • [31] Human Action Recognition Based on a Two-stream Convolutional Network Classifier
    Silva, Vincius de Oliveira
    Vidal, Flavio de Barros
    Soares Romariz, Alexandre Ricardo
    2017 16TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2017, : 774 - 778
  • [32] On the spatial attention in spatio-temporal graph convolutional networks for skeleton-based human action recognition
    Heidari, Negar
    Iosifidis, Alexandros
    Proceedings of the International Joint Conference on Neural Networks, 2021, 2021-July
  • [33] On the spatial attention in spatio-temporal graph convolutional networks for skeleton-based human action recognition
    Heidari, Negar
    Iosifidis, Alexandros
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [34] Skeleton-based emotion recognition based on two-stream self-attention enhanced spatial-temporal graph convolutional network
    Shi, Jiaqi
    Liu, Chaoran
    Ishi, Carlos Toshinori
    Ishiguro, Hiroshi
    Sensors (Switzerland), 2021, 21 (01): : 1 - 16
  • [35] Skeleton-Based Emotion Recognition Based on Two-Stream Self-Attention Enhanced Spatial-Temporal Graph Convolutional Network
    Shi, Jiaqi
    Liu, Chaoran
    Ishi, Carlos Toshinori
    Ishiguro, Hiroshi
    SENSORS, 2021, 21 (01) : 1 - 16
  • [36] Comparison between Recurrent Networks and Temporal Convolutional Networks Approaches for Skeleton-Based Action Recognition
    Nan, Mihai
    Trascau, Mihai
    Florea, Adina Magda
    Iacob, Cezar Catalin
    SENSORS, 2021, 21 (06) : 1 - 19
  • [37] Distinct Two-Stream Convolutional Networks for Human Action Recognition in Videos Using Segment-Based Temporal Modeling
    Sarabu, Ashok
    Santra, Ajit Kumar
    DATA, 2020, 5 (04) : 1 - 12
  • [38] A comparative review of graph convolutional networks for human skeleton-based action recognition
    Liqi Feng
    Yaqin Zhao
    Wenxuan Zhao
    Jiaxi Tang
    Artificial Intelligence Review, 2022, 55 : 4275 - 4305
  • [39] A comparative review of graph convolutional networks for human skeleton-based action recognition
    Feng, Liqi
    Zhao, Yaqin
    Zhao, Wenxuan
    Tang, Jiaxi
    ARTIFICIAL INTELLIGENCE REVIEW, 2022, 55 (05) : 4275 - 4305
  • [40] Focus on temporal graph convolutional networks with unified attention for skeleton-based action recognition
    Gao, Bing-Kun
    Dong, Le
    Bi, Hong-Bo
    Bi, Yun-Ze
    APPLIED INTELLIGENCE, 2022, 52 (05) : 5608 - 5616