An analysis of Convolutional Long Short-Term Memory Recurrent Neural Networks for gesture recognition

被引:164
|
作者
Tsironi, Eleni [1 ]
Barros, Pablo [1 ]
Weber, Cornelius [1 ]
Wermter, Stefan [1 ]
机构
[1] Univ Hamburg, Dept Comp Sci, Knowledge Technol WTM, Vogt Koelln Str 30, D-22527 Hamburg, Germany
关键词
Gesture recognition; CNN; LSTM; CNN visualization;
D O I
10.1016/j.neucom.2016.12.088
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this research, we analyze a Convolutional Long Short-Term Memory Recurrent Neural Network (CNNLSTM) in the context of gesture recognition. CNNLSTMs are able to successfully learn gestures of varying duration and complexity. For this reason, we analyze the architecture by presenting a qualitative evaluation of the model, based on the visualization of the internal representations of the convolutional layers and on the examination of the temporal classification outputs at a frame level, in order to check if they match the cognitive perception of a gesture. We show that CNNLSTM learns the temporal evolution of the gestures classifying correctly their meaningful part, known as Kendon's stroke phase. With the visualization, for which we use the deconvolution process that maps specific feature map activations to original image pixels, we show that the network learns to detect the most intense body motion. Finally, we show that CNNLSTM outperforms both plain CNN and LSTM in gesture recognition. (C) 2017 The Authors. Published by Elsevier B.V.
引用
收藏
页码:76 / 86
页数:11
相关论文
共 50 条
  • [1] Gesture Recognition Using Wearable Sensors With Bi-Long Short-Term Memory Convolutional Neural Networks
    Nguyen-Trong, Khanh
    Vu, Hoai Nam
    Trung, Ngon Nguyen
    Pham, Cuong
    [J]. IEEE SENSORS JOURNAL, 2021, 21 (13) : 15065 - 15079
  • [2] Long Short-Term Memory based Convolutional Recurrent Neural Networks for Large Vocabulary Speech Recognition
    Li, Xiangang
    Wu, Xihong
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3219 - 3223
  • [3] Research on gesture EMG recognition based on long short-term memory and convolutional neural network
    Chen, Sijia
    Luo, Zhizeng
    [J]. Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2021, 42 (02): : 162 - 170
  • [4] Convolutional Neural Networks and Long Short-Term Memory for skeleton-based human activity and hand gesture recognition
    Nunez, Juan C.
    Cabido, Raul
    Pantrigo, Juan J.
    Montemayor, Antonio S.
    Velez, Jose F.
    [J]. PATTERN RECOGNITION, 2018, 76 : 80 - 94
  • [5] Toward Transportation Mode Recognition Using Deep Convolutional and Long Short-Term Memory Recurrent Neural Networks
    Qin, Yanjun
    Luo, Haiyong
    Zhao, Fang
    Wang, Chenxing
    Wang, Jiaqi
    Zhang, Yuexia
    [J]. IEEE ACCESS, 2019, 7 : 142353 - 142367
  • [6] A Comparative Review of Convolutional Neural Networks, Long Short-Term Memory, and Recurrent Neural Networks in Recommendation Systems
    Tyagi, Geetanjali
    Ray, Susmita
    [J]. ARTIFICIAL INTELLIGENCE: THEORY AND APPLICATIONS, VOL 1, AITA 2023, 2024, 843 : 395 - 408
  • [7] Convolutional Grid Long Short-Term Memory Recurrent Neural Network for Automatic Speech Recognition
    Xue, Jiabin
    Zheng, Tieran
    Han, Jiqing
    [J]. NEURAL INFORMATION PROCESSING, ICONIP 2019, PT V, 2019, 1143 : 718 - 726
  • [8] BIDIRECTIONAL QUATERNION LONG SHORT-TERM MEMORY RECURRENT NEURAL NETWORKS FOR SPEECH RECOGNITION
    Parcollet, Titouan
    Morchid, Mohamed
    Linares, Georges
    De Mori, Renato
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 8519 - 8523
  • [9] Handwriting Recognition with Large Multidimensional Long Short-Term Memory Recurrent Neural Networks
    Voigtlaender, Paul
    Doetsch, Patrick
    Ney, Hermann
    [J]. PROCEEDINGS OF 2016 15TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2016, : 228 - 233
  • [10] Deep neural learning techniques with long short-term memory for gesture recognition
    Jain, Deepak Kumar
    Mahanti, Aniket
    Shamsolmoali, Pourya
    Manikandan, Ramachandran
    [J]. NEURAL COMPUTING & APPLICATIONS, 2020, 32 (20): : 16073 - 16089