Multimodal Gesture Recognition Using Multi-stream Recurrent Neural Network

被引:36
|
作者
Nishida, Noriki [1 ]
Nakayama, Hideki [1 ]
机构
[1] Univ Tokyo, Grad Sch Informat Sci & Technol, Machine Percept Grp, Tokyo, Japan
来源
关键词
Multimodal gesture recognition; Recurrent neural networks; Long short-term memory; Convolutional neural networks;
D O I
10.1007/978-3-319-29451-3_54
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a novel method for multimodal gesture recognition based on neural networks. Our multi-stream recurrent neural network (MRNN) is a completely data-driven model that can be trained from end to end without domain-specific hand engineering. The MRNN extends recurrent neural networks with Long Short-Term Memory cells (LSTM-RNNs) that facilitate the handling of variable-length gestures. We propose a recurrent approach for fusing multiple temporal modalities using multiple streams of LSTM-RNNs. In addition, we propose alternative fusion architectures and empirically evaluate the performance and robustness of these fusion strategies. Experimental results demonstrate that the proposed MRNN outperforms other state-of-the-art methods in the Sheffield Kinect Gesture (SKIG) dataset, and has significantly high robustness to noisy inputs.
引用
收藏
页码:682 / 694
页数:13
相关论文
共 50 条
  • [1] Multi-stream fusion network for continuous gesture recognition based on sEMG
    Li, Jun
    Zou, Chunlong
    Tang, Dalai
    Sun, Ying
    Fan, Hanwen
    Li, Boao
    Tang, Xinjie
    [J]. International Journal of Wireless and Mobile Computing, 2024, 26 (04) : 374 - 383
  • [2] Multimodal Egocentric Activity Recognition Using Multi-stream CNN
    Imran, Javed
    Raman, Balasubramanian
    [J]. ELEVENTH INDIAN CONFERENCE ON COMPUTER VISION, GRAPHICS AND IMAGE PROCESSING (ICVGIP 2018), 2018,
  • [3] Dynamic Gesture Recognition Using Surface EMG Signals Based on Multi-Stream Residual Network
    Yang, Zhiwen
    Jiang, Du
    Sun, Ying
    Tao, Bo
    Tong, Xiliang
    Jiang, Guozhang
    Xu, Manman
    Yun, Juntong
    Liu, Ying
    Chen, Baojia
    Kong, Jianyi
    [J]. FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2021, 9
  • [4] A multi-stream convolutional neural network for sEMG-based gesture recognition in muscle-computer interface
    Wei, Wentao
    Wong, Yongkang
    Du, Yu
    Hu, Yu
    Kankanhalli, Mohan
    Geng, Weidong
    [J]. PATTERN RECOGNITION LETTERS, 2019, 119 : 131 - 138
  • [5] Viewpoint guided multi-stream neural network for skeleton action recognition
    He, Yicheng
    Liang, Zixi
    He, Shaocong
    Wang, Yonghua
    Yin, Ming
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (03) : 6783 - 6802
  • [6] Multi-Stream Convolutional Neural Network for SAR Automatic Target Recognition
    Zhao, Pengfei
    Liu, Kai
    Zou, Hao
    Zhen, Xiantong
    [J]. REMOTE SENSING, 2018, 10 (09)
  • [7] Viewpoint guided multi-stream neural network for skeleton action recognition
    Yicheng He
    Zixi Liang
    Shaocong He
    Yonghua Wang
    Ming Yin
    [J]. Multimedia Tools and Applications, 2024, 83 : 6783 - 6802
  • [8] Finger Gesture Spotting from Long Sequences Based on Multi-Stream Recurrent Neural Networks
    Benitez-Garcia, Gibran
    Haris, Muhammad
    Tsuda, Yoshiyuki
    Ukita, Norimichi
    [J]. SENSORS, 2020, 20 (02)
  • [9] Combining Information from Multi-Stream Features Using Deep Neural Network in Speech Recognition
    Zhou, Pan
    Dai, Lirong
    Liu, Qingfeng
    Jiang, Hui
    [J]. PROCEEDINGS OF 2012 IEEE 11TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP) VOLS 1-3, 2012, : 557 - +
  • [10] Improving Mineral Classification Using Multimodal Hyperspectral Point Cloud Data and Multi-Stream Neural Network
    Rizaldy, Aldino
    Afifi, Ahmed Jamal
    Ghamisi, Pedram
    Gloaguen, Richard
    [J]. REMOTE SENSING, 2024, 16 (13)