Keys for Action: An Efficient Keyframe-Based Approach for 3D Action Recognition Using a Deep Neural Network

被引:28
|
作者
Yasin, Hashim [1 ]
Hussain, Mazhar [1 ]
Weber, Andreas [2 ]
机构
[1] Natl Univ Comp & Emerging Sci, Dept Comp Sci, Islamabad 44000, Pakistan
[2] Univ Bonn, Dept Comp Sci 2, D-53115 Bonn, Germany
关键词
action recognition; deep neural network (DNN); motion capture (MoCap) datasets; keyframe extraction; MOTION CAPTURE; SEQUENCE;
D O I
10.3390/s20082226
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
In this paper, we propose a novel and efficient framework for 3D action recognition using a deep learning architecture. First, we develop a 3D normalized pose space that consists of only 3D normalized poses, which are generated by discarding translation and orientation information. From these poses, we extract joint features and employ them further in a Deep Neural Network (DNN) in order to learn the action model. The architecture of our DNN consists of two hidden layers with the sigmoid activation function and an output layer with the softmax function. Furthermore, we propose a keyframe extraction methodology through which, from a motion sequence of 3D frames, we efficiently extract the keyframes that contribute substantially to the performance of the action. In this way, we eliminate redundant frames and reduce the length of the motion. More precisely, we ultimately summarize the motion sequence, while preserving the original motion semantics. We only consider the remaining essential informative frames in the process of action recognition, and the proposed pipeline is sufficiently fast and robust as a result. Finally, we evaluate our proposed framework intensively on publicly available benchmark Motion Capture (MoCap) datasets, namely HDM05 and CMU. From our experiments, we reveal that our proposed scheme significantly outperforms other state-of-the-art approaches.
引用
下载
收藏
页数:24
相关论文
共 50 条
  • [41] Graph-based approach for 3D human skeletal action recognition
    Li, Meng
    Leung, Howard
    PATTERN RECOGNITION LETTERS, 2017, 87 : 195 - 202
  • [42] Action recognition using 3D DAISY descriptor
    Xiaochun Cao
    Hua Zhang
    Chao Deng
    Qiguang Liu
    Hanyu Liu
    Machine Vision and Applications, 2014, 25 : 159 - 171
  • [43] Effective 3D action recognition using EigenJoints
    Yang, Xiaodong
    Tian, YingLi
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2014, 25 (01) : 2 - 11
  • [44] Action recognition using 3D DAISY descriptor
    Cao, Xiaochun
    Zhang, Hua
    Deng, Chao
    Liu, Qiguang
    Liu, Hanyu
    MACHINE VISION AND APPLICATIONS, 2014, 25 (01) : 159 - 171
  • [45] 3D Face Recognition Method Based on Deep Convolutional Neural Network
    Feng, Jianying
    Guo, Qian
    Guan, Yudong
    Wu, Mengdie
    Zhang, Xingrui
    Ti, Chunli
    SMART INNOVATIONS IN COMMUNICATION AND COMPUTATIONAL SCIENCES, VOL 2, 2019, 670 : 123 - 130
  • [46] Deep spatiotemporal LSTM network with temporal pattern feature for 3D human action recognition
    Wu, Yirui
    Wei, Lianglei
    Duan, Yucong
    COMPUTATIONAL INTELLIGENCE, 2019, 35 (03) : 535 - 554
  • [47] Action Recognition Using Deep 3D CNNs with Sequential Feature Aggregation and Attention
    Anvarov, Fazliddin
    Kim, Dae Ha
    Song, Byung Cheol
    ELECTRONICS, 2020, 9 (01)
  • [48] Using Gabor Filter in 3D Convolutional Neural Networks for Human Action Recognition
    Li, Jiakun
    Wang, Tian
    Zhou, Yi
    Wang, Ziyu
    Snoussi, Hichem
    PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 11139 - 11144
  • [49] PointDMIG: a dynamic motion-informed graph neural network for 3D action recognition
    Du, Yao
    Hou, Zhenjie
    Li, Xing
    Liang, Jiuzhen
    You, Kaijun
    Zhou, Xinwen
    MULTIMEDIA SYSTEMS, 2024, 30 (04)
  • [50] Improving human action recognition with two-stream 3D convolutional neural network
    Van-Minh Khong
    Thanh-Hai Tran
    2018 1ST INTERNATIONAL CONFERENCE ON MULTIMEDIA ANALYSIS AND PATTERN RECOGNITION (MAPR), 2018,