Finger Gesture Spotting from Long Sequences Based on Multi-Stream Recurrent Neural Networks

被引：7

作者：

Benitez-Garcia, Gibran ^{[1
,3
]}

Haris, Muhammad ^{[1
]}

Tsuda, Yoshiyuki ^{[2
]}

Ukita, Norimichi ^{[1
]}

机构：

[1] Toyota Technol Inst, Nagoya, Aichi 4688511, Japan

[2] DENSO Corp, Kariya, Aichi 4488661, Japan

[3] Univ Electrocommun, Dept Informat, Chofu, Tokyo 1828585, Japan

来源：

SENSORS | 2020年 / 20卷 / 02期

关键词：

gesture spotting; human-computer interaction; automotive user interfaces; in-vehicle sensors; recurrent neural networks;

D O I：

10.3390/s20020528

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

Gesture spotting is an essential task for recognizing finger gestures used to control in-car touchless interfaces. Automated methods to achieve this task require to detect video segments where gestures are observed, to discard natural behaviors of users' hands that may look as target gestures, and be able to work online. In this paper, we address these challenges with a recurrent neural architecture for online finger gesture spotting. We propose a multi-stream network merging hand and hand-location features, which help to discriminate target gestures from natural movements of the hand, since these may not happen in the same 3D spatial location. Our multi-stream recurrent neural network (RNN) recurrently learns semantic information, allowing to spot gestures online in long untrimmed video sequences. In order to validate our method, we collect a finger gesture dataset in an in-vehicle scenario of an autonomous car. 226 videos with more than 2100 continuous instances were captured with a depth sensor. On this dataset, our gesture spotting approach outperforms state-of-the-art methods with an improvement of about 10% and 15% of recall and precision, respectively. Furthermore, we demonstrated that by combining with an existing gesture classifier (a 3D Convolutional Neural Network), our proposal achieves better performance than previous hand gesture recognition methods.

引用

页数：18

共 50 条

[41] Perceptual-Based Playout Mechanisms for Multi-Stream Voice over IP Networks
Wu, Chun-Feng
Chang, Wen-Whei
Chiang, Yuan-Chuan
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2011, E94D (05): : 1018 - 1025
[42] Multi-stream mixed graph convolutional networks for skeleton-based action recognition
Zhuang, Boyuan
Kong, Jun
Jiang, Min
Liu, Tianshan
JOURNAL OF ELECTRONIC IMAGING, 2021, 30 (06)
[43] Perceptual-Based Playout Mechanisms for Multi-Stream Voice over IP Networks
Wu, Chun-Feng
Lee, Cheng-Lung
Chang, Wen-Whei
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 205 - 208
[44] Multi-stream slowFast graph convolutional networks for skeleton-based action recognition
Sun, Ning
Leng, Ling
Liu, Jixin
Han, Guang
IMAGE AND VISION COMPUTING, 2021, 109
[45] Skeleton-Based Action Recognition With Multi-Stream Adaptive Graph Convolutional Networks
Shi, Lei
Zhang, Yifan
Cheng, Jian
Lu, Hanqing
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 9532 - 9545
[46] Multi-Stream Convolutional Neural Networks for Rotating Machinery Fault Diagnosis under Noise and Trend Items
Dong, Han
Lu, Jiping
Han, Yafeng
SENSORS, 2022, 22 (07)
[47] Finger-Vein Verification Based on LSTM Recurrent Neural Networks
Qin, Huafeng
Wang, Peng
APPLIED SCIENCES-BASEL, 2019, 9 (08):
[48] Human Fall Detection Using 3D Multi-Stream Convolutional Neural Networks with Fusion
Alanazi, Thamer
Muhammad, Ghulam
DIAGNOSTICS, 2022, 12 (12)
[49] An evolving ensemble model of multi-stream convolutional neural networks for human action recognition in still images
Slade, Sam
Zhang, Li
Yu, Yonghong
Lim, Chee Peng
NEURAL COMPUTING & APPLICATIONS, 2022, 34 (11): : 9205 - 9231
[50] Weighted voting of multi-stream convolutional neural networks for video-based action recognition using optical flow rhythms
Brito, Andre de Souza
Vieira, Marcelo Bernardes
Villela, Saulo Moraes
Tacon, Hemerson
Chaves, Hugo de Lima
Maia, Helena de Almeida
Concha, Darwin Ttito
Pedrini, Helio
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 77

← 1 2 3 4 5 →