A GCN and Transformer complementary network for skeleton-based action recognition

被引:0
|
作者
Xiang, Xuezhi [1 ,2 ]
Li, Xiaoheng [1 ]
Liu, Xuzhao [1 ]
Qiao, Yulong [1 ,2 ]
El Saddik, Abdulmotaleb [3 ]
机构
[1] School of Information and Communication Engineering, Harbin Engineering University, Harbin,150001, China
[2] Key Laboratory of Advanced Marine Communication and Information Technology, Ministry of Industry and Information Technology, Harbin,150001, China
[3] School of Electrical Engineering and Computer Science, University of Ottawa, Ottawa,ON K1N 6N5, Canada
基金
中国国家自然科学基金;
关键词
Joints; (anatomy);
D O I
10.1016/j.cviu.2024.104213
中图分类号
学科分类号
摘要
Graph Convolution Networks (GCNs) have been widely used in skeleton-based action recognition. Although there are significant progress, the inherent limitation still lies in the restricted receptive field of GCN, hindering its ability to extract global dependencies effectively. And the joints that are structurally separated can also have strong correlation. Previous works rarely explore local and global correlations of joints, leading to insufficiently model the complex dynamics of skeleton sequences. To address this issue, we propose a GCN and Transformer complementary network (GTC-Net) that allows parallel communications between GCN and Transformer domains. Specifically, we introduce a graph convolution and self-attention combined module (GAM), which can effectively leverage the complementarity of GCN and self-attention to perceive local and global dependencies of joints for the human body. Furthermore, in order to address the problems of long-term sequence ordering and position detection, we design a position-aware module (PAM), which can explicitly capture the ordering information and unique identity information for body joints of skeleton sequence. Extensive experiments on NTU RGB+D 60 and NTU RGB+D 120 datasets are conducted to evaluate our proposed method. The results demonstrate that our method can achieve competitive results on both datasets. © 2024 Elsevier Inc.
引用
收藏
相关论文
共 50 条
  • [41] JOINTS RELATION INFERENCE NETWORK FOR SKELETON-BASED ACTION RECOGNITION
    Ye, Fanfan
    Tang, Huiming
    Wang, Xuwen
    Liang, Xiao
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 16 - 20
  • [42] Sequence Segmentation Attention Network for Skeleton-Based Action Recognition
    Zhang, Yujie
    Cai, Haibin
    ELECTRONICS, 2023, 12 (07)
  • [43] Feedback Graph Convolutional Network for Skeleton-Based Action Recognition
    Yang, Hao
    Yan, Dan
    Zhang, Li
    Sun, Yunda
    Li, Dong
    Maybank, Stephen J.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 164 - 175
  • [44] Symmetrical Enhanced Fusion Network for Skeleton-Based Action Recognition
    Kong, Jun
    Deng, Haoyang
    Jiang, Min
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (11) : 4394 - 4408
  • [45] Hierarchical Graph Convolutional Network for Skeleton-Based Action Recognition
    Huang, Linjiang
    Huang, Yan
    Ouyang, Wanli
    Wang, Liang
    IMAGE AND GRAPHICS, ICIG 2019, PT I, 2019, 11901 : 93 - 102
  • [46] Revisiting Skeleton-based Action Recognition
    Duan, Haodong
    Zhao, Yue
    Chen, Kai
    Lin, Dahua
    Dai, Bo
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 2959 - 2968
  • [47] Skeleton-based action recognition based on multidimensional adaptive convolutional network
    Xia, Yu
    Gao, Qingyuan
    Wu, Weiguan
    Cao, Yi
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 127
  • [48] Hybrid features for skeleton-based action recognition based on network fusion
    Chen, Zhangmeng
    Pan, Junjun
    Yang, Xiaosong
    Qin, Hong
    COMPUTER ANIMATION AND VIRTUAL WORLDS, 2020, 31 (4-5)
  • [49] STSD: spatial–temporal semantic decomposition transformer for skeleton-based action recognition
    Hu Cui
    Tessai Hayama
    Multimedia Systems, 2024, 30
  • [50] TranSkeleton: Hierarchical Spatial-Temporal Transformer for Skeleton-Based Action Recognition
    Liu, Haowei
    Liu, Yongcheng
    Chen, Yuxin
    Yuan, Chunfeng
    Li, Bing
    Hu, Weiming
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (08) : 4137 - 4148