Multi-scale skeleton simplification graph convolutional network for skeleton-based action recognition

被引:0
|
作者
Fan, Zhang [1 ]
Ding, Chongyang [1 ]
Kai, Liu [1 ]
Liu, Hongjin [2 ]
机构
[1] Xidian Univ, Sch Comp Sci & Technol, Xian, Peoples R China
[2] SunWise Space Technol, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
computer vision; convolution; feature extraction; neural net architecture; neural nets;
D O I
10.1049/cvi2.12300
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human action recognition based on graph convolutional networks (GCNs) is one of the hotspots in computer vision. However, previous methods generally rely on handcrafted graph, which limits the effectiveness of the model in characterising the connections between indirectly connected joints. The limitation leads to weakened connections when joints are separated by long distances. To address the above issue, the authors propose a skeleton simplification method which aims to reduce the number of joints and the distance between joints by merging adjacent joints into simplified joints. Group convolutional block is devised to extract the internal features of the simplified joints. Additionally, the authors enhance the method by introducing multi-scale modelling, which maps inputs into sequences across various levels of simplification. Combining with spatial temporal graph convolution, a multi-scale skeleton simplification GCN for skeleton-based action recognition (M3S-GCN) is proposed for fusing multi-scale skeleton sequences and modelling the connections between joints. Finally, M3S-GCN is evaluated on five benchmarks of NTU RGB+D 60 (C-Sub, C-View), NTU RGB+D 120 (X-Sub, X-Set) and NW-UCLA datasets. Experimental results show that the authors' M3S-GCN achieves state-of-the-art performance with the accuracies of 93.0%, 97.0% and 91.2% on C-Sub, C-View and X-Set benchmarks, which validates the effectiveness of the method. The authors propose a multi-scale skeleton simplification graph convolutional network (M3S-GCN) for skeleton-based action recognition. The model leverages skeleton simplification and multi-scale modelling to effectively capture the intricate connections between the joints, and achieves state-of-the-art performance on three benchmarks, the NTU RGB+D C-Sub, NTU RGB+D C-View and NTU RGB+D 120 X-Set. image
引用
收藏
页码:992 / 1003
页数:12
相关论文
共 50 条
  • [31] Temporal Refinement Graph Convolutional Network for Skeleton-Based Action Recognition
    Zhuang T.
    Qin Z.
    Ding Y.
    Deng F.
    Chen L.
    Qin Z.
    Raymond Choo K.-K.
    IEEE Transactions on Artificial Intelligence, 2024, 5 (04): : 1586 - 1598
  • [32] EchoGCN: An Echo Graph Convolutional Network for Skeleton-Based Action Recognition
    Qian, Weiwen
    Huang, Qian
    Li, Chang
    Chen, Zhongqi
    Mao, Yingchi
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), (245-261):
  • [33] Pose Refinement Graph Convolutional Network for Skeleton-Based Action Recognition
    Li, Shijie
    Yi, Jinhui
    Abu Farha, Yazan
    Gall, Juergen
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02): : 1028 - 1035
  • [34] Spatial adaptive graph convolutional network for skeleton-based action recognition
    Qilin Zhu
    Hongmin Deng
    Applied Intelligence, 2023, 53 : 17796 - 17808
  • [35] Pyramidal Graph Convolutional Network for Skeleton-Based Human Action Recognition
    Li, Fanjia
    Zhu, Aichun
    Liu, Zhongyu
    Huo, Yu
    Xu, Yonggang
    Hua, Gang
    IEEE SENSORS JOURNAL, 2021, 21 (14) : 16183 - 16191
  • [36] Channel attention and multi-scale graph neural networks for skeleton-based action recognition
    Dang, Ronghao
    Liu, Chengju
    Liu, Ming
    Chen, Qijun
    AI COMMUNICATIONS, 2022, 35 (03) : 187 - 205
  • [37] Kernel Attention Based Multi-scale Adaptive Graph Convolutional Neural Network for Skeleton-Based
    Liu, Yanan
    Zhang, Hao
    Xu, Dan
    2021 IEEE 7TH INTERNATIONAL CONFERENCE ON VIRTUAL REALITY (ICVR 2021), 2021, : 96 - 103
  • [38] Cross-Scale Spatial Refinement Graph Convolutional Network for Skeleton-Based Action Recognition
    Chengyuan Ke
    Sheng Liu
    Zhenghao Ke
    Yuan Feng
    Shengyong Chen
    International Journal of Computational Intelligence Systems, 18 (1)
  • [39] Multi-stream ternary enhanced graph convolutional network for skeleton-based action recognition
    Kong, Jun
    Wang, Shengquan
    Jiang, Min
    Liu, TianShan
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (25): : 18487 - 18504
  • [40] Two Stream Multi-Attention Graph Convolutional Network for Skeleton-Based Action Recognition
    Zhou, Huijian
    Tian, Zhiqiang
    Du, Shaoyi
    ARTIFICIAL INTELLIGENCE AND ROBOTICS, ISAIR 2023, 2024, 1998 : 112 - 120