Direction-guided two-stream convolutional neural networks for skeleton-based action recognition

被引:0
|
作者
Benyue Su
Peng Zhang
Manzhen Sun
Min Sheng
机构
[1] Anqing Normal University,Key Laboratory of Intelligent Perception and Computing of Anhui Province
[2] Anqing Normal University,School of Computer and Information
[3] Tongling University,School of Mathematics and Computer
[4] Anqing Normal University,School of Mathematics and Physics
来源
Soft Computing | 2023年 / 27卷
关键词
Action recognition; Skeleton data; Direction; Edge-level information; Motion information; Feature fusion;
D O I
暂无
中图分类号
学科分类号
摘要
In skeleton-based action recognition, treating skeleton data as pseudoimages using convolutional neural networks (CNNs) has proven to be effective. However, among existing CNN-based approaches, most focus on modeling information at the joint-level ignoring the size and direction information of the skeleton edges, which play an important role in action recognition, and these approaches may not be optimal. In addition, combining the directionality of human motion to portray action motion variation information is rarely considered in existing approaches, although it is more natural and reasonable for action sequence modeling. In this work, we propose a novel direction-guided two-stream convolutional neural network for skeleton-based action recognition. In the first stream, our model focuses on our defined edge-level information (including edge and edge_motion information) with directionality in the skeleton data to explore the spatiotemporal features of the action. In the second stream, since the motion is directional, we define different skeleton edge directions and extract different motion information (including translation and rotation information) in different directions to better exploit the motion features of the action. In addition, we propose a description of human motion inscribed by a combination of translation and rotation, and explore how they are integrated. We conducted extensive experiments on two challenging datasets, the NTU-RGB+D 60 and NTU-RGB+D 120 datasets, to verify the superiority of our proposed method over state-of-the-art methods. The experimental results demonstrate that the proposed direction-guided edge-level information and motion information complement each other for better action recognition.
引用
收藏
页码:11833 / 11842
页数:9
相关论文
共 50 条
  • [11] Skeleton-Based Action Recognition With Gated Convolutional Neural Networks
    Cao, Congqi
    Lan, Cuiling
    Zhang, Yifan
    Zeng, Wenjun
    Lu, Hanqing
    Zhang, Yanning
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (11) : 3247 - 3257
  • [12] Skeleton-Based Action Recognition Through Contrasting Two-Stream Spatial-Temporal Networks
    Pang, Chen
    Lu, Xuequan
    Lyu, Lei
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 8699 - 8711
  • [13] Two-stream adaptive-attentional subgraph convolution networks for skeleton-based action recognition
    Xianshan Li
    Fengchan Meng
    Fengda Zhao
    Dingding Guo
    Fengwei Lou
    Rong Jing
    Multimedia Tools and Applications, 2022, 81 : 4821 - 4838
  • [14] Two-stream adaptive-attentional subgraph convolution networks for skeleton-based action recognition
    Li, Xianshan
    Meng, Fengchan
    Zhao, Fengda
    Guo, Dingding
    Lou, Fengwei
    Jing, Rong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (04) : 4821 - 4838
  • [15] Skeleton action recognition using Two-Stream Adaptive Graph Convolutional Networks
    Lee, James
    Kang, Suk-ju
    2021 36TH INTERNATIONAL TECHNICAL CONFERENCE ON CIRCUITS/SYSTEMS, COMPUTERS AND COMMUNICATIONS (ITC-CSCC), 2021,
  • [16] Graph Edge Convolutional Neural Networks for Skeleton-Based Action Recognition
    Zhang, Xikun
    Xu, Chang
    Tian, Xinmei
    Tao, Dacheng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (08) : 3047 - 3060
  • [17] Two-stream spatio-temporal GCN-transformer networks for skeleton-based action recognition
    Chen, Dong
    Chen, Mingdong
    Wu, Peisong
    Wu, Mengtao
    Zhang, Tao
    Li, Chuanqi
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [18] Two-stream spatiotemporal networks for skeleton action recognition
    Wang, Lei
    Zhang, Jianwei
    Yang, Shanmin
    Gu, Song
    IET IMAGE PROCESSING, 2023, 17 (11) : 3358 - 3370
  • [19] Two-stream Flow-guided Convolutional Attention Networks for Action Recognition
    Tran, An
    Cheong, Loong-Fah
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 3110 - 3119
  • [20] Pose-Guided Graph Convolutional Networks for Skeleton-Based Action Recognition
    Chen, Han
    Jiang, Yifan
    Ko, Hanseok
    IEEE ACCESS, 2022, 10 : 111725 - 111731