Multi-scale skeleton simplification graph convolutional network for skeleton-based action recognition

被引:0
|
作者
Fan, Zhang [1 ]
Ding, Chongyang [1 ]
Kai, Liu [1 ]
Liu, Hongjin [2 ]
机构
[1] Xidian Univ, Sch Comp Sci & Technol, Xian, Peoples R China
[2] SunWise Space Technol, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
computer vision; convolution; feature extraction; neural net architecture; neural nets;
D O I
10.1049/cvi2.12300
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human action recognition based on graph convolutional networks (GCNs) is one of the hotspots in computer vision. However, previous methods generally rely on handcrafted graph, which limits the effectiveness of the model in characterising the connections between indirectly connected joints. The limitation leads to weakened connections when joints are separated by long distances. To address the above issue, the authors propose a skeleton simplification method which aims to reduce the number of joints and the distance between joints by merging adjacent joints into simplified joints. Group convolutional block is devised to extract the internal features of the simplified joints. Additionally, the authors enhance the method by introducing multi-scale modelling, which maps inputs into sequences across various levels of simplification. Combining with spatial temporal graph convolution, a multi-scale skeleton simplification GCN for skeleton-based action recognition (M3S-GCN) is proposed for fusing multi-scale skeleton sequences and modelling the connections between joints. Finally, M3S-GCN is evaluated on five benchmarks of NTU RGB+D 60 (C-Sub, C-View), NTU RGB+D 120 (X-Sub, X-Set) and NW-UCLA datasets. Experimental results show that the authors' M3S-GCN achieves state-of-the-art performance with the accuracies of 93.0%, 97.0% and 91.2% on C-Sub, C-View and X-Set benchmarks, which validates the effectiveness of the method. The authors propose a multi-scale skeleton simplification graph convolutional network (M3S-GCN) for skeleton-based action recognition. The model leverages skeleton simplification and multi-scale modelling to effectively capture the intricate connections between the joints, and achieves state-of-the-art performance on three benchmarks, the NTU RGB+D C-Sub, NTU RGB+D C-View and NTU RGB+D 120 X-Set. image
引用
收藏
页码:992 / 1003
页数:12
相关论文
共 50 条
  • [1] Multi-scale Structural Graph Convolutional Network for Skeleton-based Action Recognition
    Jang S.
    Lee H.
    Kim W.J.
    Lee J.
    Woo S.
    Lee S.
    IEEE Transactions on Circuits and Systems for Video Technology, 2024, 34 (08) : 1 - 1
  • [2] Multi-scale Dilated Attention Graph Convolutional Network for Skeleton-Based Action Recognition
    Shu, Yang
    Li, Wanggen
    Li, Doudou
    Gao, Kun
    Jie, Biao
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT I, 2024, 14425 : 16 - 28
  • [3] Multi-Scale Adaptive Aggregate Graph Convolutional Network for Skeleton-Based Action Recognition
    Zheng, Zhiyun
    Wang, Yizhou
    Zhang, Xingjin
    Wang, Junfeng
    APPLIED SCIENCES-BASEL, 2022, 12 (03):
  • [4] Multi-Scale Spatial Temporal Graph Convolutional Network for Skeleton-Based Action Recognition
    Chen, Zhan
    Li, Sicheng
    Yang, Bing
    Li, Qinghan
    LiU, Hong
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 1113 - 1122
  • [5] Lighter and faster: A multi-scale adaptive graph convolutional network for skeleton-based action recognition
    Jiang, Yuanjian
    Deng, Hongmin
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 132
  • [6] Multi-Scale Adaptive Graph Convolution Network for Skeleton-Based Action Recognition
    Hu, Huangshui
    Fang, Yue
    Han, Mei
    Qi, Xingshuo
    IEEE ACCESS, 2024, 12 : 16868 - 16880
  • [7] Multi-scale sampling attention graph convolutional networks for skeleton-based action recognition
    Tian, Haoyu
    Zhang, Yipeng
    Wu, Hanbo
    Ma, Xin
    Li, Yibin
    NEUROCOMPUTING, 2024, 597
  • [8] Scale Adaptive Graph Convolutional Network for Skeleton-Based Action Recognition
    Wang X.
    Zhong Y.
    Jin L.
    Xiao Y.
    Tianjin Daxue Xuebao (Ziran Kexue yu Gongcheng Jishu Ban)/Journal of Tianjin University Science and Technology, 2022, 55 (03): : 306 - 312
  • [9] Skeleton-Based Action Recognition Using Multi-Scale and Multi-Stream Improved Graph Convolutional Network
    Li, Wang
    Liu, Xu
    Liu, Zheng
    Du, Feixiang
    Zou, Qiang
    IEEE ACCESS, 2020, 8 (08): : 144529 - 144542
  • [10] Multi-scale spatial–temporal convolutional neural network for skeleton-based action recognition
    Qin Cheng
    Jun Cheng
    Ziliang Ren
    Qieshi Zhang
    Jianming Liu
    Pattern Analysis and Applications, 2023, 26 (3) : 1303 - 1315