Self-Adaptive Graph With Nonlocal Attention Network for Skeleton-Based Action Recognition

被引:3
|
作者
Pang, Chen [1 ,2 ]
Gao, Xingyu [3 ]
Chen, Zhenyu [4 ,5 ]
Lyu, Lei [1 ,2 ]
机构
[1] Shandong Normal Univ, Sch Informat Sci & Engn, Jinan 250358, Peoples R China
[2] Shandong Normal Univ, Shandong Prov Key Lab Novel Distributed Comp Softw, Jinan 250358, Peoples R China
[3] Chinese Acad Sci, Inst Microelect, Beijing 100029, Peoples R China
[4] State Grid Corp China, Big Data Ctr, Beijing 100031, Peoples R China
[5] China Elect Power Res Inst, Beijing 100192, Peoples R China
基金
中国国家自然科学基金;
关键词
Skeleton; Convolution; Feature extraction; Data models; Correlation; Convolutional neural networks; Adaptation models; Action recognition; global attention; graph convolutional network (GCN); self-adaptive graph;
D O I
10.1109/TNNLS.2023.3298950
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Graph convolutional networks (GCNs) have achieved encouraging progress in modeling human body skeletons as spatial-temporal graphs. However, existing methods still suffer from two inherent drawbacks. Firstly, these models process the input data based on the physical structure of the human body, which leads to some latent correlations among joints being ignored. Furthermore, the key temporal relationships between nonadjacent frames are overlooked, preventing to fully learn the changes of the body joints along the temporal dimension. To address these issues, we propose an innovative spatial-temporal model by introducing a self-adaptive GCN (SAGCN) with global attention network, collectively termed SAGGAN. Specifically, the SAGCN module is proposed to construct two additional dynamic topological graphs to learn the common characteristics of all data and represent a unique pattern for each sample, respectively. Meanwhile, the global attention module (spatial attention (SA) and temporal attention (TA) modules) is designed to extract the global connections between different joints in a single frame and model temporal relationships between adjacent and nonadjacent frames in temporal sequences. In this manner, our network can capture richer features of actions for accurate action recognition and overcome the defect of the standard graph convolution. Extensive experiments on three benchmark datasets (NTU-60, NTU-120, and Kinetics) have demonstrated the superiority of our proposed method.
引用
收藏
页码:1 / 13
页数:13
相关论文
共 50 条
  • [1] Graph convolutional network with STC attention and adaptive normalization for skeleton-based action recognition
    Zhou, Haiyun
    Xiang, Xuezhi
    Qiu, Yujian
    Liu, Xuzhao
    IMAGING SCIENCE JOURNAL, 2023, 71 (07): : 636 - 646
  • [2] SelfGCN: Graph Convolution Network With Self-Attention for Skeleton-Based Action Recognition
    Wu, Zhize
    Sun, Pengpeng
    Chen, Xin
    Tang, Keke
    Xu, Tong
    Zou, Le
    Wang, Xiaofeng
    Tan, Ming
    Cheng, Fan
    Weise, Thomas
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 4391 - 4403
  • [3] MGSAN: multimodal graph self-attention network for skeleton-based action recognition
    Wang, Junyi
    Li, Ziao
    Liu, Bangli
    Cai, Haibin
    Saada, Mohamad
    Meng, Qinggang
    Multimedia Systems, 2024, 30 (06)
  • [4] Spatial adaptive graph convolutional network for skeleton-based action recognition
    Zhu, Qilin
    Deng, Hongmin
    APPLIED INTELLIGENCE, 2023, 53 (14) : 17796 - 17808
  • [5] Scale Adaptive Graph Convolutional Network for Skeleton-Based Action Recognition
    Wang X.
    Zhong Y.
    Jin L.
    Xiao Y.
    Tianjin Daxue Xuebao (Ziran Kexue yu Gongcheng Jishu Ban)/Journal of Tianjin University Science and Technology, 2022, 55 (03): : 306 - 312
  • [6] Spatial adaptive graph convolutional network for skeleton-based action recognition
    Qilin Zhu
    Hongmin Deng
    Applied Intelligence, 2023, 53 : 17796 - 17808
  • [7] Adaptive Attention Memory Graph Convolutional Networks for Skeleton-Based Action Recognition
    Liu, Di
    Xu, Hui
    Wang, Jianzhong
    Lu, Yinghua
    Kong, Jun
    Qi, Miao
    SENSORS, 2021, 21 (20)
  • [8] An efficient self-attention network for skeleton-based action recognition
    Xiaofei Qin
    Rui Cai
    Jiabin Yu
    Changxiang He
    Xuedian Zhang
    Scientific Reports, 12 (1)
  • [9] An efficient self-attention network for skeleton-based action recognition
    Qin, Xiaofei
    Cai, Rui
    Yu, Jiabin
    He, Changxiang
    Zhang, Xuedian
    SCIENTIFIC REPORTS, 2022, 12 (01):
  • [10] Self-Attention Network for Skeleton-based Human Action Recognition
    Cho, Sangwoo
    Maqbool, Muhammad Hasan
    Liu, Fei
    Foroosh, Hassan
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 624 - 633