A GCN and Transformer complementary network for skeleton-based action recognition

被引：0

作者：

Xiang, Xuezhi ^{[1
,2
]}

Li, Xiaoheng ^{[1
]}

Liu, Xuzhao ^{[1
]}

Qiao, Yulong ^{[1
,2
]}

El Saddik, Abdulmotaleb ^{[3
]}

机构：

[1] School of Information and Communication Engineering, Harbin Engineering University, Harbin,150001, China

[2] Key Laboratory of Advanced Marine Communication and Information Technology, Ministry of Industry and Information Technology, Harbin,150001, China

[3] School of Electrical Engineering and Computer Science, University of Ottawa, Ottawa,ON K1N 6N5, Canada

来源：

Computer Vision and Image Understanding | 2024年 / 249卷

基金：

中国国家自然科学基金;

关键词：

Joints; (anatomy);

D O I：

10.1016/j.cviu.2024.104213

中图分类号：

学科分类号：

摘要：

Graph Convolution Networks (GCNs) have been widely used in skeleton-based action recognition. Although there are significant progress, the inherent limitation still lies in the restricted receptive field of GCN, hindering its ability to extract global dependencies effectively. And the joints that are structurally separated can also have strong correlation. Previous works rarely explore local and global correlations of joints, leading to insufficiently model the complex dynamics of skeleton sequences. To address this issue, we propose a GCN and Transformer complementary network (GTC-Net) that allows parallel communications between GCN and Transformer domains. Specifically, we introduce a graph convolution and self-attention combined module (GAM), which can effectively leverage the complementarity of GCN and self-attention to perceive local and global dependencies of joints for the human body. Furthermore, in order to address the problems of long-term sequence ordering and position detection, we design a position-aware module (PAM), which can explicitly capture the ordering information and unique identity information for body joints of skeleton sequence. Extensive experiments on NTU RGB+D 60 and NTU RGB+D 120 datasets are conducted to evaluate our proposed method. The results demonstrate that our method can achieve competitive results on both datasets. © 2024 Elsevier Inc.

引用

共 50 条

[41] JOINTS RELATION INFERENCE NETWORK FOR SKELETON-BASED ACTION RECOGNITION
Ye, Fanfan
Tang, Huiming
Wang, Xuwen
Liang, Xiao
2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 16 - 20
[42] Sequence Segmentation Attention Network for Skeleton-Based Action Recognition
Zhang, Yujie
Cai, Haibin
ELECTRONICS, 2023, 12 (07)
[43] Feedback Graph Convolutional Network for Skeleton-Based Action Recognition
Yang, Hao
Yan, Dan
Zhang, Li
Sun, Yunda
Li, Dong
Maybank, Stephen J.
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 164 - 175
[44] Symmetrical Enhanced Fusion Network for Skeleton-Based Action Recognition
Kong, Jun
Deng, Haoyang
Jiang, Min
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (11) : 4394 - 4408
[45] Hierarchical Graph Convolutional Network for Skeleton-Based Action Recognition
Huang, Linjiang
Huang, Yan
Ouyang, Wanli
Wang, Liang
IMAGE AND GRAPHICS, ICIG 2019, PT I, 2019, 11901 : 93 - 102
[46] Revisiting Skeleton-based Action Recognition
Duan, Haodong
Zhao, Yue
Chen, Kai
Lin, Dahua
Dai, Bo
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 2959 - 2968
[47] Skeleton-based action recognition based on multidimensional adaptive convolutional network
Xia, Yu
Gao, Qingyuan
Wu, Weiguan
Cao, Yi
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 127
[48] Hybrid features for skeleton-based action recognition based on network fusion
Chen, Zhangmeng
Pan, Junjun
Yang, Xiaosong
Qin, Hong
COMPUTER ANIMATION AND VIRTUAL WORLDS, 2020, 31 (4-5)
[49] STSD: spatial–temporal semantic decomposition transformer for skeleton-based action recognition
Hu Cui
Tessai Hayama
Multimedia Systems, 2024, 30
[50] TranSkeleton: Hierarchical Spatial-Temporal Transformer for Skeleton-Based Action Recognition
Liu, Haowei
Liu, Yongcheng
Chen, Yuxin
Yuan, Chunfeng
Li, Bing
Hu, Weiming
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (08) : 4137 - 4148

← 1 2 3 4 5 →