Convolutional Neural Networks and Long Short-Term Memory for skeleton-based human activity and hand gesture recognition

被引:250
|
作者
Nunez, Juan C. [1 ]
Cabido, Raul [1 ]
Pantrigo, Juan J. [2 ]
Montemayor, Antonio S. [3 ]
Velez, Jose F. [1 ,2 ,3 ]
机构
[1] Univ Rey Juan Carlos, Madrid, Spain
[2] Univ Rey Juan Carlos, CAPO Res Grp, Madrid, Spain
[3] Univ Rey Juan Carlos, Comp Sci, Madrid, Spain
关键词
Deep learning; Convolutional Neural Network; Recurrent neural network; Long Short-Term Memory; Human activity recognition; Hand gesture recognition; Real-time; POSE;
D O I
10.1016/j.patcog.2017.10.033
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we address human activity and hand gesture recognition problems using 3D data sequences obtained from full-body and hand skeletons, respectively. To this aim, we propose a deep learning-based approach for temporal 3D pose recognition problems based on a combination of a Convolutional Neural Network (CNN) and a Long Short-Term Memory (LSTM) recurrent network. We also present a two stage training strategy which firstly focuses on CNN training and, secondly, adjusts the full method (CNN+LSTM). Experimental testing demonstrated that our training method obtains better results than a single-stage training strategy. Additionally, we propose a data augmentation method that has also been validated experimentally. Finally, we perform an extensive experimental study on publicly available data benchmarks. The results obtained show how the proposed approach reaches state-of-the-art performance when compared to the methods identified in the literature. The best results were obtained for small datasets, where the proposed data augmentation strategy has greater impact. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:80 / 94
页数:15
相关论文
共 50 条
  • [21] Skeleton-Based Action Recognition With Gated Convolutional Neural Networks
    Cao, Congqi
    Lan, Cuiling
    Zhang, Yifan
    Zeng, Wenjun
    Lu, Hanqing
    Zhang, Yanning
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (11) : 3247 - 3257
  • [22] Long Short-Term Memory based Convolutional Recurrent Neural Networks for Large Vocabulary Speech Recognition
    Li, Xiangang
    Wu, Xihong
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3219 - 3223
  • [23] Cross-Individual Gesture Recognition Based on Long Short-Term Memory Networks
    Min, Huasong
    Chen, Ziming
    Fang, Bin
    Xia, Ziwei
    Song, Yixu
    Wang, Zongtao
    Zhou, Quan
    Sun, Fuchun
    Liu, Chunfang
    [J]. SCIENTIFIC PROGRAMMING, 2021, 2021
  • [24] Bidirectional Independently Recurrent Neural Network for Skeleton-based Hand Gesture Recognition
    Li, Shuai
    Zheng, Longfei
    Zhu, Ce
    Gao, Yanbo
    [J]. 2020 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2020,
  • [25] A neural network based on SPD manifold learning for skeleton-based hand gesture recognition
    Nguyen, Xuan Son
    Brun, Luc
    Lezoray, Olivier
    Bougleux, Sebastien
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 12028 - 12037
  • [26] Pixel Convolutional Networks for Skeleton-Based Human Action Recognition
    Change, Zhichao
    Wang, Jiangyun
    Han, Liang
    [J]. METHODS AND APPLICATIONS FOR MODELING AND SIMULATION OF COMPLEX SYSTEMS, 2018, 946 : 513 - 523
  • [27] Deep neural learning techniques with long short-term memory for gesture recognition
    Jain, Deepak Kumar
    Mahanti, Aniket
    Shamsolmoali, Pourya
    Manikandan, Ramachandran
    [J]. NEURAL COMPUTING & APPLICATIONS, 2020, 32 (20): : 16073 - 16089
  • [28] Deep neural learning techniques with long short-term memory for gesture recognition
    Deepak Kumar Jain
    Aniket Mahanti
    Pourya Shamsolmoali
    Ramachandran Manikandan
    [J]. Neural Computing and Applications, 2020, 32 : 16073 - 16089
  • [29] Deep Convolutional Network with Long Short-Term Memory Layers for Dynamic Gesture Recognition
    Siriak, Rostyslav
    Skarga-Bandurova, Inna
    Boltov, Yehor
    [J]. PROCEEDINGS OF THE 2019 10TH IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT DATA ACQUISITION AND ADVANCED COMPUTING SYSTEMS - TECHNOLOGY AND APPLICATIONS (IDAACS), VOL. 1, 2019, : 158 - 162
  • [30] GIS Partial Discharge Pattern Recognition Based on a Novel Convolutional Neural Networks and Long Short-Term Memory
    Liu, Tingliang
    Yan, Jing
    Wang, Yanxin
    Xu, Yifan
    Zhao, Yiming
    [J]. ENTROPY, 2021, 23 (06)