Multi-scale Spatial and Temporal Feature Aggregation Graph Convolutional Network for Skeleton-Based Action Recognition

被引:0
|
作者
Du, Yifei [1 ]
Zhang, Mingliang [1 ]
Li, Bin [1 ]
机构
[1] Qilu Univ Technol, Shandong Acad Sci, Sch Math & Stat, Jinan 250353, Shandong, Peoples R China
关键词
Graph Convolutional Network; Skeleton Action Recognition; Multi-scale Feature Aggregation;
D O I
10.1007/978-981-97-8511-7_36
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the field of deep learning, skeleton data is widely used for action recognition. Currently, the recognition of human skeleton action based on Graph Convolutional Networks (GCNs) has occupied the main position and has achieved remarkable results. However, existing methods are not sufficiently expressive concerning temporal and spatial features. Therefore, we propose a Multi-scale Spatial and Temporal Feature Aggregation Graph Convolutional Network (MSTA-GCN) for skeleton-based action recognition, which can effectively aggregate features from spatial and temporal dimensions using a hierarchical structure. Specifically, we integrate the topology learning strategy with the edge convolution module to aggregate global and fine-grained features at the spatial dimension. On this basis, a multi-scale temporal convolution based on a temporal attention module is proposed to aggregate the node features that change within frames under the condition of guaranteeing the global temporal features. Finally, the feature refinement module of skeleton data is improved to enhance the ability of the network to represent spatial features. Our proposed MSTA-GCN outperforms most mainstream methods and achieves satisfactory performance on three large-scale datasets: NTU RGB+D 60, NTU RGB+D 120, and Northwestern-UCLA.
引用
收藏
页码:511 / 524
页数:14
相关论文
共 50 条
  • [1] Multi-Scale Spatial Temporal Graph Convolutional Network for Skeleton-Based Action Recognition
    Chen, Zhan
    Li, Sicheng
    Yang, Bing
    Li, Qinghan
    LiU, Hong
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 1113 - 1122
  • [2] Multi-scale spatial–temporal convolutional neural network for skeleton-based action recognition
    Qin Cheng
    Jun Cheng
    Ziliang Ren
    Qieshi Zhang
    Jianming Liu
    Pattern Analysis and Applications, 2023, 26 (3) : 1303 - 1315
  • [3] Multi-Scale Spatial Temporal Graph Neural Network for Skeleton-Based Action Recognition
    Feng, Dong
    Wu, ZhongCheng
    Zhang, Jun
    Ren, TingTing
    IEEE ACCESS, 2021, 9 : 58256 - 58265
  • [4] Multi-scale skeleton simplification graph convolutional network for skeleton-based action recognition
    Fan, Zhang
    Ding, Chongyang
    Kai, Liu
    Liu, Hongjin
    IET COMPUTER VISION, 2024, 18 (07) : 992 - 1003
  • [5] Multiple temporal scale aggregation graph convolutional network for skeleton-based action recognition
    Li, Xuanfeng
    Lu, Jian
    Zhou, Jian
    Liu, Wei
    Zhang, Kaibing
    COMPUTERS & ELECTRICAL ENGINEERING, 2023, 110
  • [6] Multi-Scale Structural Graph Convolutional Network for Skeleton-Based Action Recognition
    Jang, Sungjun
    Lee, Heansung
    Kim, Woo Jin
    Lee, Jungho
    Woo, Sungmin
    Lee, Sangyoun
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (08) : 7244 - 7258
  • [7] Multi-scale spatial-temporal convolutional neural network for skeleton-based action recognition
    Cheng, Qin
    Cheng, Jun
    Ren, Ziliang
    Zhang, Qieshi
    Liu, Jianming
    PATTERN ANALYSIS AND APPLICATIONS, 2023, 26 (03) : 1303 - 1315
  • [8] Multi-temporal scale aggregation refinement graph convolutional network for skeleton-based action recognition
    Li, Xuanfeng
    Lu, Jian
    Zhou, Jian
    Liu, Wei
    Zhang, Kaibing
    COMPUTER ANIMATION AND VIRTUAL WORLDS, 2024, 35 (01)
  • [9] Multi-scale Dilated Attention Graph Convolutional Network for Skeleton-Based Action Recognition
    Shu, Yang
    Li, Wanggen
    Li, Doudou
    Gao, Kun
    Jie, Biao
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT I, 2024, 14425 : 16 - 28
  • [10] Multi-Scale Adaptive Aggregate Graph Convolutional Network for Skeleton-Based Action Recognition
    Zheng, Zhiyun
    Wang, Yizhou
    Zhang, Xingjin
    Wang, Junfeng
    APPLIED SCIENCES-BASEL, 2022, 12 (03):