Direction-guided two-stream convolutional neural networks for skeleton-based action recognition

被引：0

作者：

Benyue Su

Peng Zhang

Manzhen Sun

Min Sheng

机构：

[1] Anqing Normal University,Key Laboratory of Intelligent Perception and Computing of Anhui Province

[2] Anqing Normal University,School of Computer and Information

[3] Tongling University,School of Mathematics and Computer

[4] Anqing Normal University,School of Mathematics and Physics

来源：

Soft Computing | 2023年 / 27卷

关键词：

Action recognition; Skeleton data; Direction; Edge-level information; Motion information; Feature fusion;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

In skeleton-based action recognition, treating skeleton data as pseudoimages using convolutional neural networks (CNNs) has proven to be effective. However, among existing CNN-based approaches, most focus on modeling information at the joint-level ignoring the size and direction information of the skeleton edges, which play an important role in action recognition, and these approaches may not be optimal. In addition, combining the directionality of human motion to portray action motion variation information is rarely considered in existing approaches, although it is more natural and reasonable for action sequence modeling. In this work, we propose a novel direction-guided two-stream convolutional neural network for skeleton-based action recognition. In the first stream, our model focuses on our defined edge-level information (including edge and edge_motion information) with directionality in the skeleton data to explore the spatiotemporal features of the action. In the second stream, since the motion is directional, we define different skeleton edge directions and extract different motion information (including translation and rotation information) in different directions to better exploit the motion features of the action. In addition, we propose a description of human motion inscribed by a combination of translation and rotation, and explore how they are integrated. We conducted extensive experiments on two challenging datasets, the NTU-RGB+D 60 and NTU-RGB+D 120 datasets, to verify the superiority of our proposed method over state-of-the-art methods. The experimental results demonstrate that the proposed direction-guided edge-level information and motion information complement each other for better action recognition.

引用

页码：11833 / 11842

页数：9

共 50 条

[31] Two-Stream Convolutional Neural Network for Video Action Recognition
Qiao, Han
Liu, Shuang
Xu, Qingzhen
Liu, Shouqiang
Yang, Wanggan
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2021, 15 (10): : 3668 - 3684
[32] Action Tree Convolutional Networks: Skeleton-Based Human Action Recognition
Liu, Wenjie
Zhang, Ziyi
Han, Bing
Zhu, Chenhui
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT III, 2018, 11166 : 783 - 792
[33] Selective Hypergraph Convolutional Networks for Skeleton-based Action Recognition
Zhu, Yiran
Huang, Guangji
Xu, Xing
Ji, Yanli
Shen, Fumin
PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2022, 2022, : 518 - 526
[34] Recurrent graph convolutional networks for skeleton-based action recognition
Zhu, Guangming
Yang, Lu
Zhang, Liang
Shen, Peiyi
Song, Juan
Proceedings - International Conference on Pattern Recognition, 2020, : 1352 - 1359
[35] Recurrent Graph Convolutional Networks for Skeleton-based Action Recognition
Zhu, Guangming
Yang, Lu
Zhang, Liang
Shen, Peiyi
Song, Juan
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 1352 - 1359
[36] Pixel Convolutional Networks for Skeleton-Based Human Action Recognition
Change, Zhichao
Wang, Jiangyun
Han, Liang
METHODS AND APPLICATIONS FOR MODELING AND SIMULATION OF COMPLEX SYSTEMS, 2018, 946 : 513 - 523
[37] Two-Stream Convolutional Neural Networks for Emergency Recognition in Images
Chen, Jia
Duan, Shihui
Long, Fei
Wang, Yongxing
Wang, Song
Ling, Qiang
PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 6470 - 6474
[38] Fourier analysis on robustness of graph convolutional neural networks for skeleton-based action recognition
Tanaka, Nariki
Kera, Hiroshi
Kawamoto, Kazuhiko
Computer Vision and Image Understanding, 2024, 240
[39] Fourier analysis on robustness of graph convolutional neural networks for skeleton-based action recognition
Tanaka, Nariki
Kera, Hiroshi
Kawamoto, Kazuhiko
COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 240
[40] Two Stream Multi-Attention Graph Convolutional Network for Skeleton-Based Action Recognition
Zhou, Huijian
Tian, Zhiqiang
Du, Shaoyi
ARTIFICIAL INTELLIGENCE AND ROBOTICS, ISAIR 2023, 2024, 1998 : 112 - 120

← 1 2 3 4 5 →