Skeleton-Based Action Recognition With Multi-Stream Adaptive Graph Convolutional Networks

被引:292
|
作者
Shi, Lei [1 ,2 ,3 ]
Zhang, Yifan [1 ,2 ,3 ]
Cheng, Jian [1 ,2 ,3 ]
Lu, Hanqing [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, NLPR, Beijing 100049, Peoples R China
[2] Chinese Acad Sci, Inst Automat, AIRIA, Beijing 100049, Peoples R China
[3] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
Adaptation models; Joints; Data models; Bones; Spatiotemporal phenomena; Task analysis; Skeleton-based action recognition; graph convolutional network; adaptive graph; multi-stream network;
D O I
10.1109/TIP.2020.3028207
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Graph convolutional networks (GCNs), which generalize CNNs to more generic non-Euclidean structures, have achieved remarkable performance for skeleton-based action recognition. However, there still exist several issues in the previous GCN-based models. First, the topology of the graph is set heuristically and fixed over all the model layers and input data. This may not be suitable for the hierarchy of the GCN model and the diversity of the data in action recognition tasks. Second, the second-order information of the skeleton data, i.e., the length and orientation of the bones, is rarely investigated, which is naturally more informative and discriminative for the human action recognition. In this work, we propose a novel multi-stream attention-enhanced adaptive graph convolutional neural network (MS-AAGCN) for skeleton-based action recognition. The graph topology in our model can be either uniformly or individually learned based on the input data in an end-to-end manner. This data-driven approach increases the flexibility of the model for graph construction and brings more generality to adapt to various data samples. Besides, the proposed adaptive graph convolutional layer is further enhanced by a spatial-temporal-channel attention module, which helps the model pay more attention to important joints, frames and features. Moreover, the information of both the joints and bones, together with their motion information, are simultaneously modeled in a multi-stream framework, which shows notable improvement for the recognition accuracy. Extensive experiments on the two large-scale datasets, NTU-RGBD and Kinetics-Skeleton, demonstrate that the performance of our model exceeds the state-of-the-art with a significant margin.
引用
收藏
页码:9532 / 9545
页数:14
相关论文
共 50 条
  • [41] Hierarchically Decomposed Graph Convolutional Networks for Skeleton-Based Action Recognition
    Lee, Jungho
    Lee, Minhyeok
    Lee, Dogyoon
    Lee, Sangyoun
    [J]. arXiv, 2022,
  • [42] Hierarchically Decomposed Graph Convolutional Networks for Skeleton-Based Action Recognition
    Lee, Jungho
    Lee, Minhyeok
    Lee, Dogyoon
    Lee, Sangyoun
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 10410 - 10419
  • [43] Skeleton-Based Action Recognition With Low-Level Features of Adaptive Graph Convolutional Networks
    Gang, Jialin
    Xiao, Yao
    Liu, Shenglan
    Lu, Yao
    [J]. IEEE ACCESS, 2021, 9 : 127010 - 127018
  • [44] Structure-Feature Fusion Adaptive Graph Convolutional Networks for Skeleton-Based Action Recognition
    Zhang, Zhitao
    Wang, Zhengyou
    Zhuang, Shanna
    Huang, Fuyu
    [J]. IEEE ACCESS, 2020, 8 : 228108 - 228117
  • [45] Multi-Stream General and Graph-Based Deep Neural Networks for Skeleton-Based Sign Language Recognition
    Miah, Abu Saleh Musa
    Hasan, Md. Al Mehedi
    Jang, Si-Woong
    Lee, Hyoun-Sup
    Shin, Jungpil
    [J]. ELECTRONICS, 2023, 12 (13)
  • [46] Intra-Inter Region Adaptive Graph Convolutional Networks for skeleton-based action recognition
    Xu, Wenting
    Wang, Chuanxu
    Zhang, Zhe
    Lin, Guocheng
    Sun, Yue
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 98
  • [47] Multi-Scale Adaptive Aggregate Graph Convolutional Network for Skeleton-Based Action Recognition
    Zheng, Zhiyun
    Wang, Yizhou
    Zhang, Xingjin
    Wang, Junfeng
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (03):
  • [48] Graph Convolutional Networks Skeleton-based Action Recognition for Continuous Data Stream: A Sliding Window Approach
    Delamare, Mickael
    Laville, Cyril
    Cabani, Adnane
    Chafouk, Houcine
    [J]. VISAPP: PROCEEDINGS OF THE 16TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS - VOL. 5: VISAPP, 2021, : 427 - 435
  • [49] Multi-scale sampling attention graph convolutional networks for skeleton-based action recognition
    Tian, Haoyu
    Zhang, Yipeng
    Wu, Hanbo
    Ma, Xin
    Li, Yibin
    [J]. NEUROCOMPUTING, 2024, 597
  • [50] Pose-Guided Graph Convolutional Networks for Skeleton-Based Action Recognition
    Chen, Han
    Jiang, Yifan
    Ko, Hanseok
    [J]. IEEE ACCESS, 2022, 10 : 111725 - 111731