Adaptive spatiotemporal graph convolutional network with intermediate aggregation of multi-stream skeleton features for action recognition

被引:8
|
作者
Zhao, Yukai [1 ]
Wang, Jingwei [1 ]
Wang, Han [1 ]
Liu, Min [1 ]
Ma, Yunlong [1 ]
机构
[1] Tongji Univ, Sch Elect & Informat Engn, Shanghai 201804, Peoples R China
基金
中国国家自然科学基金;
关键词
Action recognition; Skeleton; Spatiotemporal graph convolutional; network; Multi -stream model; ENSEMBLE;
D O I
10.1016/j.neucom.2022.07.046
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video-based action recognition is a challenging problem due to the rapid and uncertain changes in human actions. Recent studies show that incorporating video and human body skeleton helps improve action recognition performance. These methods generally use graph convolutional networks (GCNs) to extract structural features of the human body joints from skeleton data. Yet, most GCN-based methods have some limitations in skeleton-based action recognition. (1) The graph structure of the human body joints is time-invariant, making it difficult to represent the changing relationship between joints across actions. (2) Methods relying on single-stream models only utilize limited information of skeleton data, such as joints or bones, and fail to consider coherent features of movements. (3) Methods relying on multi-stream models have considerable parameters and are inefficient for real-life applications. To address these problems, we propose an adaptive spatiotemporal graph convolutional network with inter-mediate aggregation of multi-stream skeleton features for action recognition. First, our method learns an adaptive graph structure representing the changing relationship between joints. Secondly, we facilitate a multi-stream model to extract various features from the skeleton, including joint-stream, bone-stream, and motion-stream. Moreover, an intermediate aggregation strategy is employed to aggregate these fea-tures and to reduce the parameters of this model. The proposed method has been validated on various benchmarks and a real-world abnormal action dataset. Extensive experimental results show that our method achieves excellent performance in skeleton-based action recognition. (c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:116 / 124
页数:9
相关论文
共 50 条
  • [1] Skeleton-Based Action Recognition With Multi-Stream Adaptive Graph Convolutional Networks
    Shi, Lei
    Zhang, Yifan
    Cheng, Jian
    Lu, Hanqing
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 9532 - 9545
  • [2] Skeleton Action Recognition Based on Multi-Stream Spatial Attention Graph Convolutional SRU Network
    Zhao, Jun-Nan
    She, Qing-Shan
    Meng, Ming
    Chen, Yun
    [J]. Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2022, 50 (07): : 1579 - 1585
  • [3] Multi-stream ternary enhanced graph convolutional network for skeleton-based action recognition
    Jun Kong
    Shengquan Wang
    Min Jiang
    TianShan Liu
    [J]. Neural Computing and Applications, 2023, 35 : 18487 - 18504
  • [4] Multi-stream ternary enhanced graph convolutional network for skeleton-based action recognition
    Kong, Jun
    Wang, Shengquan
    Jiang, Min
    Liu, TianShan
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (25): : 18487 - 18504
  • [5] Multi-stream adaptive spatial-temporal attention graph convolutional network for skeleton-based action recognition
    Yu, Lubin
    Tian, Lianfang
    Du, Qiliang
    Bhutto, Jameel Ahmed
    [J]. IET COMPUTER VISION, 2022, 16 (02) : 143 - 158
  • [6] Multi-stream mixed graph convolutional networks for skeleton-based action recognition
    Zhuang, Boyuan
    Kong, Jun
    Jiang, Min
    Liu, Tianshan
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2021, 30 (06)
  • [7] Multi-stream slowFast graph convolutional networks for skeleton-based action recognition
    Sun, Ning
    Leng, Ling
    Liu, Jixin
    Han, Guang
    [J]. IMAGE AND VISION COMPUTING, 2021, 109
  • [8] Multi-stream P&U adaptive graph convolutional networks for skeleton-based action recognition
    Chen, Minglong
    Liang, Jiuzhen
    Liu, Hao
    [J]. JOURNAL OF SUPERCOMPUTING, 2024, 80 (08): : 11614 - 11639
  • [9] Multi-stream P&U adaptive graph convolutional networks for skeleton-based action recognition
    Minglong Chen
    Jiuzhen Liang
    Hao Liu
    [J]. The Journal of Supercomputing, 2024, 80 : 11614 - 11639
  • [10] Skeleton-Based Action Recognition Using Multi-Scale and Multi-Stream Improved Graph Convolutional Network
    Li, Wang
    Liu, Xu
    Liu, Zheng
    Du, Feixiang
    Zou, Qiang
    [J]. IEEE ACCESS, 2020, 8 : 144529 - 144542