Multimodal Gesture Recognition Using Multi-stream Recurrent Neural Network

被引:36
|
作者
Nishida, Noriki [1 ]
Nakayama, Hideki [1 ]
机构
[1] Univ Tokyo, Grad Sch Informat Sci & Technol, Machine Percept Grp, Tokyo, Japan
来源
关键词
Multimodal gesture recognition; Recurrent neural networks; Long short-term memory; Convolutional neural networks;
D O I
10.1007/978-3-319-29451-3_54
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a novel method for multimodal gesture recognition based on neural networks. Our multi-stream recurrent neural network (MRNN) is a completely data-driven model that can be trained from end to end without domain-specific hand engineering. The MRNN extends recurrent neural networks with Long Short-Term Memory cells (LSTM-RNNs) that facilitate the handling of variable-length gestures. We propose a recurrent approach for fusing multiple temporal modalities using multiple streams of LSTM-RNNs. In addition, we propose alternative fusion architectures and empirically evaluate the performance and robustness of these fusion strategies. Experimental results demonstrate that the proposed MRNN outperforms other state-of-the-art methods in the Sheffield Kinect Gesture (SKIG) dataset, and has significantly high robustness to noisy inputs.
引用
收藏
页码:682 / 694
页数:13
相关论文
共 50 条
  • [21] Stochastic Fusion for Multi-stream Neural Network in Video Classification
    Huang, Yu-Min
    Tseng, Huan-Hsin
    Chien, Jen-Tzung
    [J]. 2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 69 - 74
  • [22] Multi-Task Recurrent Neural Network for Surgical Gesture Recognition and Progress Prediction
    van Amsterdam, Beatrice
    Clarkson, Matthew J.
    Stoyanov, Danail
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 1380 - 1386
  • [23] LSTM Recurrent Neural Network for Hand Gesture Recognition Using EMG Signals
    Toro-Ossaba, Alejandro
    Jaramillo-Tigreros, Juan
    Tejada, Juan C.
    Pena, Alejandro
    Lopez-Gonzalez, Alexandro
    Alexandre Castanho, Rui
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (19):
  • [24] A Multi-Stream Bi-Directional Recurrent Neural Network for Fine-Grained Action Detection
    Singh, Bharat
    Marks, Tim K.
    Jones, Michael
    Tuzel, Oncel
    Shao, Ming
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 1961 - 1970
  • [25] Multi-scale multi-stream deep network for car logo recognition
    Snehal Surwase
    Meenakshi Pawar
    [J]. Evolutionary Intelligence, 2023, 16 : 485 - 492
  • [26] Multi-Stream Concept Drift Self-Adaptation Using Graph Neural Network
    Zhou, Ming
    Lu, Jie
    Song, Yiliao
    Zhang, Guangquan
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (12) : 12828 - 12841
  • [27] Multi-scale multi-stream deep network for car logo recognition
    Surwase, Snehal
    Pawar, Meenakshi
    [J]. EVOLUTIONARY INTELLIGENCE, 2023, 16 (02) : 485 - 492
  • [28] Fusing multi-stream deep neural networks for facial expression recognition
    Fatima Zahra Salmam
    Abdellah Madani
    Mohamed Kissi
    [J]. Signal, Image and Video Processing, 2019, 13 : 609 - 616
  • [29] Fusing multi-stream deep neural networks for facial expression recognition
    Zahra Salmam, Fatima
    Madani, Abdellah
    Kissi, Mohamed
    [J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2019, 13 (03) : 609 - 616
  • [30] Multi-Stream Convolution-Recurrent Neural Networks Based on Attention Mechanism Fusion for Speech Emotion Recognition
    Tao, Huawei
    Geng, Lei
    Shan, Shuai
    Mai, Jingchao
    Fu, Hongliang
    [J]. ENTROPY, 2022, 24 (08)