Finger Gesture Spotting from Long Sequences Based on Multi-Stream Recurrent Neural Networks

被引:7
|
作者
Benitez-Garcia, Gibran [1 ,3 ]
Haris, Muhammad [1 ]
Tsuda, Yoshiyuki [2 ]
Ukita, Norimichi [1 ]
机构
[1] Toyota Technol Inst, Nagoya, Aichi 4688511, Japan
[2] DENSO Corp, Kariya, Aichi 4488661, Japan
[3] Univ Electrocommun, Dept Informat, Chofu, Tokyo 1828585, Japan
关键词
gesture spotting; human-computer interaction; automotive user interfaces; in-vehicle sensors; recurrent neural networks;
D O I
10.3390/s20020528
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Gesture spotting is an essential task for recognizing finger gestures used to control in-car touchless interfaces. Automated methods to achieve this task require to detect video segments where gestures are observed, to discard natural behaviors of users' hands that may look as target gestures, and be able to work online. In this paper, we address these challenges with a recurrent neural architecture for online finger gesture spotting. We propose a multi-stream network merging hand and hand-location features, which help to discriminate target gestures from natural movements of the hand, since these may not happen in the same 3D spatial location. Our multi-stream recurrent neural network (RNN) recurrently learns semantic information, allowing to spot gestures online in long untrimmed video sequences. In order to validate our method, we collect a finger gesture dataset in an in-vehicle scenario of an autonomous car. 226 videos with more than 2100 continuous instances were captured with a depth sensor. On this dataset, our gesture spotting approach outperforms state-of-the-art methods with an improvement of about 10% and 15% of recall and precision, respectively. Furthermore, we demonstrated that by combining with an existing gesture classifier (a 3D Convolutional Neural Network), our proposal achieves better performance than previous hand gesture recognition methods.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] Perceptual-Based Playout Mechanisms for Multi-Stream Voice over IP Networks
    Wu, Chun-Feng
    Chang, Wen-Whei
    Chiang, Yuan-Chuan
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2011, E94D (05): : 1018 - 1025
  • [42] Multi-stream mixed graph convolutional networks for skeleton-based action recognition
    Zhuang, Boyuan
    Kong, Jun
    Jiang, Min
    Liu, Tianshan
    JOURNAL OF ELECTRONIC IMAGING, 2021, 30 (06)
  • [43] Perceptual-Based Playout Mechanisms for Multi-Stream Voice over IP Networks
    Wu, Chun-Feng
    Lee, Cheng-Lung
    Chang, Wen-Whei
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 205 - 208
  • [44] Multi-stream slowFast graph convolutional networks for skeleton-based action recognition
    Sun, Ning
    Leng, Ling
    Liu, Jixin
    Han, Guang
    IMAGE AND VISION COMPUTING, 2021, 109
  • [45] Skeleton-Based Action Recognition With Multi-Stream Adaptive Graph Convolutional Networks
    Shi, Lei
    Zhang, Yifan
    Cheng, Jian
    Lu, Hanqing
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 9532 - 9545
  • [46] Multi-Stream Convolutional Neural Networks for Rotating Machinery Fault Diagnosis under Noise and Trend Items
    Dong, Han
    Lu, Jiping
    Han, Yafeng
    SENSORS, 2022, 22 (07)
  • [47] Finger-Vein Verification Based on LSTM Recurrent Neural Networks
    Qin, Huafeng
    Wang, Peng
    APPLIED SCIENCES-BASEL, 2019, 9 (08):
  • [48] Human Fall Detection Using 3D Multi-Stream Convolutional Neural Networks with Fusion
    Alanazi, Thamer
    Muhammad, Ghulam
    DIAGNOSTICS, 2022, 12 (12)
  • [49] An evolving ensemble model of multi-stream convolutional neural networks for human action recognition in still images
    Slade, Sam
    Zhang, Li
    Yu, Yonghong
    Lim, Chee Peng
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (11): : 9205 - 9231
  • [50] Weighted voting of multi-stream convolutional neural networks for video-based action recognition using optical flow rhythms
    Brito, Andre de Souza
    Vieira, Marcelo Bernardes
    Villela, Saulo Moraes
    Tacon, Hemerson
    Chaves, Hugo de Lima
    Maia, Helena de Almeida
    Concha, Darwin Ttito
    Pedrini, Helio
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 77