Multi-scale skeleton simplification graph convolutional network for skeleton-based action recognition

被引：0

作者：

Fan, Zhang ^{[1
]}

Ding, Chongyang ^{[1
]}

Kai, Liu ^{[1
]}

Liu, Hongjin ^{[2
]}

机构：

[1] Xidian Univ, Sch Comp Sci & Technol, Xian, Peoples R China

[2] SunWise Space Technol, Beijing, Peoples R China

来源：

IET COMPUTER VISION | 2024年

基金：

中国国家自然科学基金;

关键词：

computer vision; convolution; feature extraction; neural net architecture; neural nets;

D O I：

10.1049/cvi2.12300

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Human action recognition based on graph convolutional networks (GCNs) is one of the hotspots in computer vision. However, previous methods generally rely on handcrafted graph, which limits the effectiveness of the model in characterising the connections between indirectly connected joints. The limitation leads to weakened connections when joints are separated by long distances. To address the above issue, the authors propose a skeleton simplification method which aims to reduce the number of joints and the distance between joints by merging adjacent joints into simplified joints. Group convolutional block is devised to extract the internal features of the simplified joints. Additionally, the authors enhance the method by introducing multi-scale modelling, which maps inputs into sequences across various levels of simplification. Combining with spatial temporal graph convolution, a multi-scale skeleton simplification GCN for skeleton-based action recognition (M3S-GCN) is proposed for fusing multi-scale skeleton sequences and modelling the connections between joints. Finally, M3S-GCN is evaluated on five benchmarks of NTU RGB+D 60 (C-Sub, C-View), NTU RGB+D 120 (X-Sub, X-Set) and NW-UCLA datasets. Experimental results show that the authors' M3S-GCN achieves state-of-the-art performance with the accuracies of 93.0%, 97.0% and 91.2% on C-Sub, C-View and X-Set benchmarks, which validates the effectiveness of the method. The authors propose a multi-scale skeleton simplification graph convolutional network (M3S-GCN) for skeleton-based action recognition. The model leverages skeleton simplification and multi-scale modelling to effectively capture the intricate connections between the joints, and achieves state-of-the-art performance on three benchmarks, the NTU RGB+D C-Sub, NTU RGB+D C-View and NTU RGB+D 120 X-Set. image

引用

页码：992 / 1003

页数：12

共 50 条

[31] Temporal Refinement Graph Convolutional Network for Skeleton-Based Action Recognition
Zhuang T.
Qin Z.
Ding Y.
Deng F.
Chen L.
Qin Z.
Raymond Choo K.-K.
IEEE Transactions on Artificial Intelligence, 2024, 5 (04): : 1586 - 1598
[32] EchoGCN: An Echo Graph Convolutional Network for Skeleton-Based Action Recognition
Qian, Weiwen
Huang, Qian
Li, Chang
Chen, Zhongqi
Mao, Yingchi
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), (245-261):
[33] Pose Refinement Graph Convolutional Network for Skeleton-Based Action Recognition
Li, Shijie
Yi, Jinhui
Abu Farha, Yazan
Gall, Juergen
IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02): : 1028 - 1035
[34] Spatial adaptive graph convolutional network for skeleton-based action recognition
Qilin Zhu
Hongmin Deng
Applied Intelligence, 2023, 53 : 17796 - 17808
[35] Pyramidal Graph Convolutional Network for Skeleton-Based Human Action Recognition
Li, Fanjia
Zhu, Aichun
Liu, Zhongyu
Huo, Yu
Xu, Yonggang
Hua, Gang
IEEE SENSORS JOURNAL, 2021, 21 (14) : 16183 - 16191
[36] Channel attention and multi-scale graph neural networks for skeleton-based action recognition
Dang, Ronghao
Liu, Chengju
Liu, Ming
Chen, Qijun
AI COMMUNICATIONS, 2022, 35 (03) : 187 - 205
[37] Kernel Attention Based Multi-scale Adaptive Graph Convolutional Neural Network for Skeleton-Based
Liu, Yanan
Zhang, Hao
Xu, Dan
2021 IEEE 7TH INTERNATIONAL CONFERENCE ON VIRTUAL REALITY (ICVR 2021), 2021, : 96 - 103
[38] Cross-Scale Spatial Refinement Graph Convolutional Network for Skeleton-Based Action Recognition
Chengyuan Ke
Sheng Liu
Zhenghao Ke
Yuan Feng
Shengyong Chen
International Journal of Computational Intelligence Systems, 18 (1)
[39] Multi-stream ternary enhanced graph convolutional network for skeleton-based action recognition
Kong, Jun
Wang, Shengquan
Jiang, Min
Liu, TianShan
NEURAL COMPUTING & APPLICATIONS, 2023, 35 (25): : 18487 - 18504
[40] Two Stream Multi-Attention Graph Convolutional Network for Skeleton-Based Action Recognition
Zhou, Huijian
Tian, Zhiqiang
Du, Shaoyi
ARTIFICIAL INTELLIGENCE AND ROBOTICS, ISAIR 2023, 2024, 1998 : 112 - 120

← 1 2 3 4 5 →