Finger Gesture Spotting from Long Sequences Based on Multi-Stream Recurrent Neural Networks

被引:7
|
作者
Benitez-Garcia, Gibran [1 ,3 ]
Haris, Muhammad [1 ]
Tsuda, Yoshiyuki [2 ]
Ukita, Norimichi [1 ]
机构
[1] Toyota Technol Inst, Nagoya, Aichi 4688511, Japan
[2] DENSO Corp, Kariya, Aichi 4488661, Japan
[3] Univ Electrocommun, Dept Informat, Chofu, Tokyo 1828585, Japan
关键词
gesture spotting; human-computer interaction; automotive user interfaces; in-vehicle sensors; recurrent neural networks;
D O I
10.3390/s20020528
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Gesture spotting is an essential task for recognizing finger gestures used to control in-car touchless interfaces. Automated methods to achieve this task require to detect video segments where gestures are observed, to discard natural behaviors of users' hands that may look as target gestures, and be able to work online. In this paper, we address these challenges with a recurrent neural architecture for online finger gesture spotting. We propose a multi-stream network merging hand and hand-location features, which help to discriminate target gestures from natural movements of the hand, since these may not happen in the same 3D spatial location. Our multi-stream recurrent neural network (RNN) recurrently learns semantic information, allowing to spot gestures online in long untrimmed video sequences. In order to validate our method, we collect a finger gesture dataset in an in-vehicle scenario of an autonomous car. 226 videos with more than 2100 continuous instances were captured with a depth sensor. On this dataset, our gesture spotting approach outperforms state-of-the-art methods with an improvement of about 10% and 15% of recall and precision, respectively. Furthermore, we demonstrated that by combining with an existing gesture classifier (a 3D Convolutional Neural Network), our proposal achieves better performance than previous hand gesture recognition methods.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Multimodal Gesture Recognition Using Multi-stream Recurrent Neural Network
    Nishida, Noriki
    Nakayama, Hideki
    IMAGE AND VIDEO TECHNOLOGY, PSIVT 2015, 2016, 9431 : 682 - 694
  • [2] Fight Detection in Video Sequences Based on Multi-Stream Convolutional Neural Networks
    Carneiro, Sarah Almeida
    da Silva, Gabriel Pellegrino
    Guimaraes, Silvio Jamil F.
    Pedrini, Helio
    2019 32ND SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI), 2019, : 8 - 15
  • [3] Multi-Stream Convolutional Neural Networks for Action Recognition in Video Sequences Based on Adaptive Visual Rhythms
    Concha, Darwin Ttito
    Maia, Helena de Almeida
    Pedrini, Helio
    Tacon, Hemerson
    Brito, Andre de Souza
    Chaves, Hugo de Lima
    Vieira, Marcelo Bernardes
    2018 17TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2018, : 473 - 480
  • [4] Multi-Stream Convolution-Recurrent Neural Networks Based on Attention Mechanism Fusion for Speech Emotion Recognition
    Tao, Huawei
    Geng, Lei
    Shan, Shuai
    Mai, Jingchao
    Fu, Hongliang
    ENTROPY, 2022, 24 (08)
  • [5] Enhanced multi-stream Kalman filter training for recurrent networks
    Feldkamp, LA
    Prokhorov, DV
    Eagen, CF
    Yuan, FM
    NONLINEAR MODELING: ADVANCED BLACK-BOX TECHNIQUES, 1998, : 29 - 53
  • [6] Region based multi-stream convolutional neural networks for collective activity recognition
    Zalluhoglu, Cemil
    Ikizler-Cinbis, Nazli
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 60 : 170 - 179
  • [7] Mild cognitive impairment prediction based on multi-stream convolutional neural networks
    Lee, Chien-Cheng
    Chau, Hong-Han
    Wang, Hsiao-Lun
    Chuang, Yi-Fang
    Chau, Yawgeng
    BMC BIOINFORMATICS, 2024, 22 (SUPPL 5):
  • [8] Multi-stream fusion network for continuous gesture recognition based on sEMG
    Li J.
    Zou C.
    Tang D.
    Sun Y.
    Fan H.
    Li B.
    Tang X.
    International Journal of Wireless and Mobile Computing, 2024, 26 (04): : 374 - 383
  • [9] Cervical Cell Features Based Multi-Stream Convolutional Neural Networks Classification Method
    Yang Z.
    Li Y.
    Yang B.
    Pang W.
    Tian Z.
    Wang Y.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2019, 31 (04): : 531 - 540
  • [10] A multi-stream convolutional neural network for sEMG-based gesture recognition in muscle-computer interface
    Wei, Wentao
    Wong, Yongkang
    Du, Yu
    Hu, Yu
    Kankanhalli, Mohan
    Geng, Weidong
    PATTERN RECOGNITION LETTERS, 2019, 119 : 131 - 138