Multi-stream slowFast graph convolutional networks for skeleton-based action recognition

被引：22

作者：

Sun, Ning ^{[1
]}

Leng, Ling ^{[2
]}

Liu, Jixin ^{[1
]}

Han, Guang ^{[1
]}

机构：

[1] Nanjing Univ Posts & Telecommun, Engn Res Ctr Wideband Wireless Commun Technol, Minist Educ, Nanjing 210003, Peoples R China

[2] Nanjing Univ Posts & Telecommun, Coll Commun & Informat Engn, Nanjing 210003, Peoples R China

来源：

IMAGE AND VISION COMPUTING | 2021年 / 109卷

基金：

中国国家自然科学基金;

关键词：

Action recognition; Graph convolutional network; Human skeleton; SlowFast network; Attention;

D O I：

10.1016/j.imavis.2021.104141

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recently, many efforts have been made to model spatial-temporal features from human skeleton for action recognition by using graph convolutional networks (GCN). Skeleton sequence can precisely represent human pose with a small number of joints while there is still a lot of redundancies across the skeleton sequence in the term of temporal dependency. In order to improve the effectiveness of spatial-temporal feature extraction from skeleton sequence, a SlowFast graph convolution network (SF-GCN) is proposed by implementing the architecture of SlowFast network, which is consisted of the Fast and Slow pathway, in the GCN model. The Fast pathway is a temporal attention embedded lightweight GCN for extracting the feature of fast temporal changes from the skeleton sequence with a high frame rate and fast refreshing speed. The Slow pathway is a spatial attention embedded GCN for extracting the feature of slow temporal changes from the skeleton sequence with a low frame rate and slow refreshing speed. The features of two pathways are fused by using lateral connection and weighted by using channel attention. Based on the aforementioned design, SF-GCN can achieve superior ability of feature extraction while the computational cost significantly drops. In addition to the coordinate information of joints, five high order sequences including edge, the spatial difference and temporal difference of joints and edges are induced to enhance the representation of human action. Six SF-GCNs are implemented for extracting spatial- temporal feature from six kinds of sequences and fused for skeleton-based action recognition, which is called multi-stream SlowFast graph convolutional networks (MSSF-GCN). Extensive experiments are conducted to evaluate the proposed method on three skeleton-based action recognition databases including NTU RGB + D, NTU RGB + D 120, and Skeleton-Kinetics. The results show that the proposed method is effective for skeleton based action recognition and can achieve the recognition accuracy with an obvious advantage in comparison with the state-of-the-art. (c) 2021 Elsevier B.V. All rights reserved.

引用

页数：9

共 50 条

[31] Hierarchically Decomposed Graph Convolutional Networks for Skeleton-Based Action Recognition
Lee, Jungho
Lee, Minhyeok
Lee, Dogyoon
Lee, Sangyoun
[J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 10410 - 10419
[32] Hierarchically Decomposed Graph Convolutional Networks for Skeleton-Based Action Recognition
Lee, Jungho
Lee, Minhyeok
Lee, Dogyoon
Lee, Sangyoun
[J]. arXiv, 2022,
[33] Multi-Stream General and Graph-Based Deep Neural Networks for Skeleton-Based Sign Language Recognition
Miah, Abu Saleh Musa
Hasan, Md. Al Mehedi
Jang, Si-Woong
Lee, Hyoun-Sup
Shin, Jungpil
[J]. ELECTRONICS, 2023, 12 (13)
[34] Graph Convolutional Networks Skeleton-based Action Recognition for Continuous Data Stream: A Sliding Window Approach
Delamare, Mickael
Laville, Cyril
Cabani, Adnane
Chafouk, Houcine
[J]. VISAPP: PROCEEDINGS OF THE 16TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS - VOL. 5: VISAPP, 2021, : 427 - 435
[35] Multi-stream adaptive 3D attention graph convolution network for skeleton-based action recognition
Yu, Lubin
Tian, Lianfang
Du, Qiliang
Bhutto, Jameel Ahmed
[J]. APPLIED INTELLIGENCE, 2023, 53 (12) : 14838 - 14854
[36] Multi-scale sampling attention graph convolutional networks for skeleton-based action recognition
Tian, Haoyu
Zhang, Yipeng
Wu, Hanbo
Ma, Xin
Li, Yibin
[J]. NEUROCOMPUTING, 2024, 597
[37] Skeleton-based multi-stream adaptive-attentional sub-graph convolution network for action recognition
Liu, Huan
Wu, Jian
Ma, Haokai
Yan, Yuqi
He, Rui
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (1) : 2935 - 2958
[38] Multi-stream adaptive 3D attention graph convolution network for skeleton-based action recognition
Lubin Yu
Lianfang Tian
Qiliang Du
Jameel Ahmed Bhutto
[J]. Applied Intelligence, 2023, 53 : 14838 - 14854
[39] Skeleton-based multi-stream adaptive-attentional sub-graph convolution network for action recognition
Huan Liu
Jian Wu
Haokai Ma
Yuqi Yan
Rui He
[J]. Multimedia Tools and Applications, 2024, 83 : 2935 - 2958
[40] Dual-domain graph convolutional networks for skeleton-based action recognition
Chen, Shuo
Xu, Ke
Mi, Zhongjie
Jiang, Xinghao
Sun, Tanfeng
[J]. MACHINE LEARNING, 2022, 111 (07) : 2381 - 2406

← 1 2 3 4 5 →