Multi-scale skeleton simplification graph convolutional network for skeleton-based action recognition

被引:0
|
作者
Fan, Zhang [1 ]
Ding, Chongyang [1 ]
Kai, Liu [1 ]
Liu, Hongjin [2 ]
机构
[1] Xidian Univ, Sch Comp Sci & Technol, Xian, Peoples R China
[2] SunWise Space Technol, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
computer vision; convolution; feature extraction; neural net architecture; neural nets;
D O I
10.1049/cvi2.12300
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human action recognition based on graph convolutional networks (GCNs) is one of the hotspots in computer vision. However, previous methods generally rely on handcrafted graph, which limits the effectiveness of the model in characterising the connections between indirectly connected joints. The limitation leads to weakened connections when joints are separated by long distances. To address the above issue, the authors propose a skeleton simplification method which aims to reduce the number of joints and the distance between joints by merging adjacent joints into simplified joints. Group convolutional block is devised to extract the internal features of the simplified joints. Additionally, the authors enhance the method by introducing multi-scale modelling, which maps inputs into sequences across various levels of simplification. Combining with spatial temporal graph convolution, a multi-scale skeleton simplification GCN for skeleton-based action recognition (M3S-GCN) is proposed for fusing multi-scale skeleton sequences and modelling the connections between joints. Finally, M3S-GCN is evaluated on five benchmarks of NTU RGB+D 60 (C-Sub, C-View), NTU RGB+D 120 (X-Sub, X-Set) and NW-UCLA datasets. Experimental results show that the authors' M3S-GCN achieves state-of-the-art performance with the accuracies of 93.0%, 97.0% and 91.2% on C-Sub, C-View and X-Set benchmarks, which validates the effectiveness of the method. The authors propose a multi-scale skeleton simplification graph convolutional network (M3S-GCN) for skeleton-based action recognition. The model leverages skeleton simplification and multi-scale modelling to effectively capture the intricate connections between the joints, and achieves state-of-the-art performance on three benchmarks, the NTU RGB+D C-Sub, NTU RGB+D C-View and NTU RGB+D 120 X-Set. image
引用
收藏
页码:992 / 1003
页数:12
相关论文
共 50 条
  • [41] Multi-stream ternary enhanced graph convolutional network for skeleton-based action recognition
    Jun Kong
    Shengquan Wang
    Min Jiang
    TianShan Liu
    Neural Computing and Applications, 2023, 35 : 18487 - 18504
  • [42] A Vertex-Edge Graph Convolutional Network for Skeleton-Based Action Recognition
    Liu, Kai
    Gao, Lei
    Khan, Naimul Mefraz
    Qi, Lin
    Guan, Ling
    2020 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2020,
  • [43] Convolutional relation network for skeleton-based action recognition
    Zhu, Jiagang
    Zou, Wei
    Zhu, Zheng
    Hu, Yiming
    NEUROCOMPUTING, 2019, 370 : 109 - 117
  • [44] Temporal Receptive Field Graph Convolutional Network for Skeleton-Based Action Recognition
    Zhang, Qingqi
    Wu, Ren
    Nakata, Mitsuru
    Ge, Qi-Wei
    2024 International Technical Conference on Circuits/Systems, Computers, and Communications, ITC-CSCC 2024, 2024,
  • [45] Spatial Graph Convolutional and Temporal Involution Network for Skeleton-based Action Recognition
    Wan, Huifan
    Pan, Guanghui
    Chen, Yu
    Ding, Danni
    Zou, Maoyang
    PROCEEDINGS OF ACM TURING AWARD CELEBRATION CONFERENCE, ACM TURC 2021, 2021, : 204 - 209
  • [46] An Attention Enhanced Graph Convolutional LSTM Network for Skeleton-Based Action Recognition
    Si, Chenyang
    Chen, Wentao
    Wang, Wei
    Wang, Liang
    Tan, Tieniu
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 1227 - 1236
  • [47] Independent Dual Graph Attention Convolutional Network for Skeleton-Based Action Recognition
    Huo, Jinze
    Cai, Haibin
    Meng, Qinggang
    NEUROCOMPUTING, 2024, 583
  • [48] Part-Level Graph Convolutional Network for Skeleton-Based Action Recognition
    Huang, Linjiang
    Huang, Yan
    Ouyang, Wanli
    Wang, Liang
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11045 - 11052
  • [49] Richly Activated Graph Convolutional Network for Robust Skeleton-Based Action Recognition
    Song, Yi-Fan
    Zhang, Zhang
    Shan, Caifeng
    Wang, Liang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (05) : 1915 - 1925
  • [50] Multi-Relational Graph Convolutional Networks for Skeleton-Based Action Recognition
    Liu, Fang
    Dai, Qin
    Wang, Shengze
    Zhao, Liang
    Shi, Xiangbin
    Qiao, Jianzhong
    2020 IEEE INTL SYMP ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, INTL CONF ON BIG DATA & CLOUD COMPUTING, INTL SYMP SOCIAL COMPUTING & NETWORKING, INTL CONF ON SUSTAINABLE COMPUTING & COMMUNICATIONS (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2020), 2020, : 474 - 480