Direction-guided two-stream convolutional neural networks for skeleton-based action recognition

被引:0
|
作者
Benyue Su
Peng Zhang
Manzhen Sun
Min Sheng
机构
[1] Anqing Normal University,Key Laboratory of Intelligent Perception and Computing of Anhui Province
[2] Anqing Normal University,School of Computer and Information
[3] Tongling University,School of Mathematics and Computer
[4] Anqing Normal University,School of Mathematics and Physics
来源
Soft Computing | 2023年 / 27卷
关键词
Action recognition; Skeleton data; Direction; Edge-level information; Motion information; Feature fusion;
D O I
暂无
中图分类号
学科分类号
摘要
In skeleton-based action recognition, treating skeleton data as pseudoimages using convolutional neural networks (CNNs) has proven to be effective. However, among existing CNN-based approaches, most focus on modeling information at the joint-level ignoring the size and direction information of the skeleton edges, which play an important role in action recognition, and these approaches may not be optimal. In addition, combining the directionality of human motion to portray action motion variation information is rarely considered in existing approaches, although it is more natural and reasonable for action sequence modeling. In this work, we propose a novel direction-guided two-stream convolutional neural network for skeleton-based action recognition. In the first stream, our model focuses on our defined edge-level information (including edge and edge_motion information) with directionality in the skeleton data to explore the spatiotemporal features of the action. In the second stream, since the motion is directional, we define different skeleton edge directions and extract different motion information (including translation and rotation information) in different directions to better exploit the motion features of the action. In addition, we propose a description of human motion inscribed by a combination of translation and rotation, and explore how they are integrated. We conducted extensive experiments on two challenging datasets, the NTU-RGB+D 60 and NTU-RGB+D 120 datasets, to verify the superiority of our proposed method over state-of-the-art methods. The experimental results demonstrate that the proposed direction-guided edge-level information and motion information complement each other for better action recognition.
引用
收藏
页码:11833 / 11842
页数:9
相关论文
共 50 条
  • [31] Two-Stream Convolutional Neural Network for Video Action Recognition
    Qiao, Han
    Liu, Shuang
    Xu, Qingzhen
    Liu, Shouqiang
    Yang, Wanggan
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2021, 15 (10): : 3668 - 3684
  • [32] Action Tree Convolutional Networks: Skeleton-Based Human Action Recognition
    Liu, Wenjie
    Zhang, Ziyi
    Han, Bing
    Zhu, Chenhui
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT III, 2018, 11166 : 783 - 792
  • [33] Selective Hypergraph Convolutional Networks for Skeleton-based Action Recognition
    Zhu, Yiran
    Huang, Guangji
    Xu, Xing
    Ji, Yanli
    Shen, Fumin
    PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2022, 2022, : 518 - 526
  • [34] Recurrent graph convolutional networks for skeleton-based action recognition
    Zhu, Guangming
    Yang, Lu
    Zhang, Liang
    Shen, Peiyi
    Song, Juan
    Proceedings - International Conference on Pattern Recognition, 2020, : 1352 - 1359
  • [35] Recurrent Graph Convolutional Networks for Skeleton-based Action Recognition
    Zhu, Guangming
    Yang, Lu
    Zhang, Liang
    Shen, Peiyi
    Song, Juan
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 1352 - 1359
  • [36] Pixel Convolutional Networks for Skeleton-Based Human Action Recognition
    Change, Zhichao
    Wang, Jiangyun
    Han, Liang
    METHODS AND APPLICATIONS FOR MODELING AND SIMULATION OF COMPLEX SYSTEMS, 2018, 946 : 513 - 523
  • [37] Two-Stream Convolutional Neural Networks for Emergency Recognition in Images
    Chen, Jia
    Duan, Shihui
    Long, Fei
    Wang, Yongxing
    Wang, Song
    Ling, Qiang
    PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 6470 - 6474
  • [38] Fourier analysis on robustness of graph convolutional neural networks for skeleton-based action recognition
    Tanaka, Nariki
    Kera, Hiroshi
    Kawamoto, Kazuhiko
    Computer Vision and Image Understanding, 2024, 240
  • [39] Fourier analysis on robustness of graph convolutional neural networks for skeleton-based action recognition
    Tanaka, Nariki
    Kera, Hiroshi
    Kawamoto, Kazuhiko
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 240
  • [40] Two Stream Multi-Attention Graph Convolutional Network for Skeleton-Based Action Recognition
    Zhou, Huijian
    Tian, Zhiqiang
    Du, Shaoyi
    ARTIFICIAL INTELLIGENCE AND ROBOTICS, ISAIR 2023, 2024, 1998 : 112 - 120