Finger Gesture Spotting from Long Sequences Based on Multi-Stream Recurrent Neural Networks

被引:7
|
作者
Benitez-Garcia, Gibran [1 ,3 ]
Haris, Muhammad [1 ]
Tsuda, Yoshiyuki [2 ]
Ukita, Norimichi [1 ]
机构
[1] Toyota Technol Inst, Nagoya, Aichi 4688511, Japan
[2] DENSO Corp, Kariya, Aichi 4488661, Japan
[3] Univ Electrocommun, Dept Informat, Chofu, Tokyo 1828585, Japan
关键词
gesture spotting; human-computer interaction; automotive user interfaces; in-vehicle sensors; recurrent neural networks;
D O I
10.3390/s20020528
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Gesture spotting is an essential task for recognizing finger gestures used to control in-car touchless interfaces. Automated methods to achieve this task require to detect video segments where gestures are observed, to discard natural behaviors of users' hands that may look as target gestures, and be able to work online. In this paper, we address these challenges with a recurrent neural architecture for online finger gesture spotting. We propose a multi-stream network merging hand and hand-location features, which help to discriminate target gestures from natural movements of the hand, since these may not happen in the same 3D spatial location. Our multi-stream recurrent neural network (RNN) recurrently learns semantic information, allowing to spot gestures online in long untrimmed video sequences. In order to validate our method, we collect a finger gesture dataset in an in-vehicle scenario of an autonomous car. 226 videos with more than 2100 continuous instances were captured with a depth sensor. On this dataset, our gesture spotting approach outperforms state-of-the-art methods with an improvement of about 10% and 15% of recall and precision, respectively. Furthermore, we demonstrated that by combining with an existing gesture classifier (a 3D Convolutional Neural Network), our proposal achieves better performance than previous hand gesture recognition methods.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Multi-stream extended Kalman filter training for static and dynamic neural networks
    Puskorius, GV
    Feldkamp, LA
    SMC '97 CONFERENCE PROCEEDINGS - 1997 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5: CONFERENCE THEME: COMPUTATIONAL CYBERNETICS AND SIMULATION, 1997, : 2006 - 2011
  • [22] Multi-Stream Isolated Sign Language Recognition Based on Finger Features Derived from Pose Data
    Akdag, Ali
    Baykan, Omer Kaan
    ELECTRONICS, 2024, 13 (08)
  • [23] Multi-Stream Convolutional Neural Network-Based Wearable, Flexible Bionic Gesture Surface Muscle Feature Extraction and Recognition
    Liu, Wansu
    Lu, Biao
    FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2022, 10
  • [24] Background Knowledge Based Multi-Stream Neural Network for Text Classification
    Ren, Fuji
    Deng, Jiawen
    APPLIED SCIENCES-BASEL, 2018, 8 (12):
  • [25] Facial Beauty Prediction From Facial Parts Using Multi-Task and Multi-Stream Convolutional Neural Networks
    Vahdati, Elham
    Suen, Ching Y.
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2021, 35 (12)
  • [26] Elderly fall detection based on multi-stream deep convolutional networks
    Chadia Khraief
    Faouzi Benzarti
    Hamid Amiri
    Multimedia Tools and Applications, 2020, 79 : 19537 - 19560
  • [27] Elderly fall detection based on multi-stream deep convolutional networks
    Khraief, Chadia
    Benzarti, Faouzi
    Amiri, Hamid
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (27-28) : 19537 - 19560
  • [28] Dynamic Gesture Recognition Using Surface EMG Signals Based on Multi-Stream Residual Network
    Yang, Zhiwen
    Jiang, Du
    Sun, Ying
    Tao, Bo
    Tong, Xiliang
    Jiang, Guozhang
    Xu, Manman
    Yun, Juntong
    Liu, Ying
    Chen, Baojia
    Kong, Jianyi
    FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2021, 9
  • [29] Improved Multi-Stream Convolutional Block Attention Module for sEMG-Based Gesture Recognition
    Wang, Shudi
    Huang, Li
    Jiang, Du
    Sun, Ying
    Jiang, Guozhang
    Li, Jun
    Zou, Cejing
    Fan, Hanwen
    Xie, Yuanmin
    Xiong, Hegen
    Chen, Baojia
    FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2022, 10
  • [30] Multi-Stream General and Graph-Based Deep Neural Networks for Skeleton-Based Sign Language Recognition
    Miah, Abu Saleh Musa
    Hasan, Md. Al Mehedi
    Jang, Si-Woong
    Lee, Hyoun-Sup
    Shin, Jungpil
    ELECTRONICS, 2023, 12 (13)