Feedback Graph Convolutional Network for Skeleton-Based Action Recognition

被引:71
|
作者
Yang, Hao [1 ]
Yan, Dan [1 ]
Zhang, Li [2 ]
Sun, Yunda [1 ]
Li, Dong [2 ]
Maybank, Stephen J. [3 ]
机构
[1] Nuctech Co Ltd, R&D Ctr Artificial Intelligent, Beijing 100190, Peoples R China
[2] Tsinghua Univ, Dept Engn Phys, Beijing 100084, Peoples R China
[3] Univ London, Birkbeck Coll, Dept Comp Sci & Informat Syst, London WC1E 7HU, England
关键词
Skeleton; Feature extraction; Joints; Semantics; Predictive models; Data models; Convolution; Feedback mechanism; graph convolutional network; skeleton; action recognition;
D O I
10.1109/TIP.2021.3129117
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Skeleton-based action recognition has attracted considerable attention since the skeleton data is more robust to the dynamic circumstances and complicated backgrounds than other modalities. Recently, many researchers have used the Graph Convolutional Network (GCN) to model spatial-temporal features of skeleton sequences by an end-to-end optimization. However, conventional GCNs are feedforward networks for which it is impossible for the shallower layers to access semantic information in the high-level layers. In this paper, we propose a novel network, named Feedback Graph Convolutional Network (FGCN). This is the first work that introduces a feedback mechanism into GCNs for action recognition. Compared with conventional GCNs, FGCN has the following advantages: (1) A multi-stage temporal sampling strategy is designed to extract spatial-temporal features for action recognition in a coarse to fine process; (2) A Feedback Graph Convolutional Block (FGCB) is proposed to introduce dense feedback connections into the GCNs. It transmits the high-level semantic features to the shallower layers and conveys temporal information stage by stage to model video level spatial-temporal features for action recognition; (3) The FGCN model provides predictions on-the-fly. In the early stages, its predictions are relatively coarse. These coarse predictions are treated as priors to guide the feature learning in later stages, to obtain more accurate predictions. Extensive experiments on three datasets, NTU-RGB+D, NTU-RGB+D120 and Northwestern-UCLA, demonstrate that the proposed FGCN is effective for action recognition. It achieves the state-of-the-art performance on all three datasets.
引用
收藏
页码:164 / 175
页数:12
相关论文
共 50 条
  • [41] Spatial-temporal slowfast graph convolutional network for skeleton-based action recognition
    Fang, Zheng
    Zhang, Xiongwei
    Cao, Tieyong
    Zheng, Yunfei
    Sun, Meng
    [J]. IET COMPUTER VISION, 2022, 16 (03) : 205 - 217
  • [42] Multi-scale Structural Graph Convolutional Network for Skeleton-based Action Recognition
    Jang S.
    Lee H.
    Kim W.J.
    Lee J.
    Woo S.
    Lee S.
    [J]. IEEE Transactions on Circuits and Systems for Video Technology, 2024, 34 (08) : 1 - 1
  • [43] An improved spatial temporal graph convolutional network for robust skeleton-based action recognition
    Xing, Yuling
    Zhu, Jia
    Li, Yu
    Huang, Jin
    Song, Jinlong
    [J]. APPLIED INTELLIGENCE, 2023, 53 (04) : 4592 - 4608
  • [44] A tri-attention enhanced graph convolutional network for skeleton-based action recognition
    Li, Xingming
    Zhai, Wei
    Cao, Yang
    [J]. IET COMPUTER VISION, 2021, 15 (02) : 110 - 121
  • [45] Enhanced Spatial and Extended Temporal Graph Convolutional Network for Skeleton-Based Action Recognition
    Li, Fanjia
    Li, Juanjuan
    Zhu, Aichun
    Xu, Yonggang
    Yin, Hongsheng
    Hua, Gang
    [J]. SENSORS, 2020, 20 (18) : 1 - 19
  • [46] Spatial-Temporal Adaptive Graph Convolutional Network for Skeleton-Based Action Recognition
    Hang, Rui
    Li, MinXian
    [J]. COMPUTER VISION - ACCV 2022, PT IV, 2023, 13844 : 172 - 188
  • [47] Graph convolutional network with STC attention and adaptive normalization for skeleton-based action recognition
    Zhou, Haiyun
    Xiang, Xuezhi
    Qiu, Yujian
    Liu, Xuzhao
    [J]. IMAGING SCIENCE JOURNAL, 2023, 71 (07): : 636 - 646
  • [48] Scene context-aware graph convolutional network for skeleton-based action recognition
    Zhang, Wenxian
    [J]. IET COMPUTER VISION, 2024, 18 (03) : 343 - 354
  • [49] An improved spatial temporal graph convolutional network for robust skeleton-based action recognition
    Yuling Xing
    Jia Zhu
    Yu Li
    Jin Huang
    Jinlong Song
    [J]. Applied Intelligence, 2023, 53 : 4592 - 4608
  • [50] Attention-Based Generative Graph Convolutional Network for Skeleton-Based Human Action Recognition
    Yang, Kai
    Ding, Xiaolu
    Chen, Wai
    [J]. ICVIP 2019: PROCEEDINGS OF 2019 3RD INTERNATIONAL CONFERENCE ON VIDEO AND IMAGE PROCESSING, 2019, : 1 - 6