Direction-guided two-stream convolutional neural networks for skeleton-based action recognition

被引:3
|
作者
Su, Benyue [1 ,3 ]
Zhang, Peng [1 ,2 ]
Sun, Manzhen [1 ,2 ]
Sheng, Min [4 ]
机构
[1] Anqing Normal Univ, Key Lab Intelligent Percept & Comp Anhui Prov, Anqing 246133, Anhui, Peoples R China
[2] Anqing Normal Univ, Sch Comp & Informat, Anqing 246133, Anhui, Peoples R China
[3] Tongling Univ, Sch Math & Comp, Tongling 244061, Anhui, Peoples R China
[4] Anqing Normal Univ, Sch Math & Phys, Anqing 246133, Anhui, Peoples R China
关键词
Action recognition; Skeleton data; Direction; Edge-level information; Motion information; Feature fusion;
D O I
10.1007/s00500-023-07862-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In skeleton-based action recognition, treating skeleton data as pseudoimages using convolutional neural networks (CNNs) has proven to be effective. However, among existing CNN-based approaches, most focus on modeling information at the joint-level ignoring the size and direction information of the skeleton edges, which play an important role in action recognition, and these approaches may not be optimal. In addition, combining the directionality of human motion to portray action motion variation information is rarely considered in existing approaches, although it is more natural and reasonable for action sequence modeling. In this work, we propose a novel direction-guided two-stream convolutional neural network for skeleton-based action recognition. In the first stream, our model focuses on our defined edge-level information (including edge and edge_motion information) with directionality in the skeleton data to explore the spatiotemporal features of the action. In the second stream, since the motion is directional, we define different skeleton edge directions and extract different motion information (including translation and rotation information) in different directions to better exploit the motion features of the action. In addition, we propose a description of human motion inscribed by a combination of translation and rotation, and explore how they are integrated. We conducted extensive experiments on two challenging datasets, the NTU-RGB+D 60 and NTU-RGB+D 120 datasets, to verify the superiority of our proposed method over state-of-the-art methods. The experimental results demonstrate that the proposed direction-guided edge-level information and motion information complement each other for better action recognition.
引用
收藏
页码:11833 / 11842
页数:10
相关论文
共 50 条
  • [1] Direction-guided two-stream convolutional neural networks for skeleton-based action recognition
    Benyue Su
    Peng Zhang
    Manzhen Sun
    Min Sheng
    [J]. Soft Computing, 2023, 27 : 11833 - 11842
  • [2] Two-Stream Adaptive Graph Convolutional Networks for Skeleton-Based Action Recognition
    Shi, Lei
    Zhang, Yifan
    Cheng, Jian
    Lu, Hanqing
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 12018 - 12027
  • [3] Two-Stream Temporal Convolutional Networks for Skeleton-Based Human Action Recognition
    Jin-Gong Jia
    Yuan-Feng Zhou
    Xing-Wei Hao
    Feng Li
    Christian Desrosiers
    Cai-Ming Zhang
    [J]. Journal of Computer Science and Technology, 2020, 35 : 538 - 550
  • [4] Two-Stream Temporal Convolutional Networks for Skeleton-Based Human Action Recognition
    Jia, Jin-Gong
    Zhou, Yuan-Feng
    Hao, Xing-Wei
    Li, Feng
    Desrosiers, Christian
    Zhang, Cai-Ming
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2020, 35 (03) : 538 - 550
  • [5] Two-Stream Spatial Graphormer Networks for Skeleton-Based Action Recognition
    Li, Xiaolei
    Zhang, Junyou
    Wang, Shufeng
    Zhou, Qian
    [J]. IEEE ACCESS, 2022, 10 : 100426 - 100437
  • [6] Interactive two-stream graph neural network for skeleton-based action recognition
    Yang, Dun
    Zhou, Qing
    Wen, Ju
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2021, 30 (03)
  • [7] Beyond Two-stream: Skeleton-based Three-stream Networks for Action Recognition in Videos
    Xu, Jianfeng
    Tasaka, Kazuyuki
    Yanagihara, Hiromasa
    [J]. 2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 1567 - 1573
  • [8] SKELETON-BASED ACTION RECOGNITION WITH CONVOLUTIONAL NEURAL NETWORKS
    Li, Chao
    Zhong, Qiaoyong
    Xie, Di
    Pu, Shiliang
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2017,
  • [9] 2s-GATCN: Two-Stream Graph Attentional Convolutional Networks for Skeleton-Based Action Recognition
    Zhou, Shu-Bo
    Chen, Ran-Ran
    Jiang, Xue-Qin
    Pan, Feng
    [J]. ELECTRONICS, 2023, 12 (07)
  • [10] Skeleton-based Action Recognition Using Two-stream Graph Convolutional Network with Pose Refinement
    Zheng, Biao
    Chen, Luefeng
    Wu, Min
    Pedrycz, Witold
    Hirota, Kaoru
    [J]. 2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 6353 - 6356