Real-time Hand Gesture Detection and Classification Using Convolutional Neural Networks

被引:109
|
作者
Koepueklue, Okan [1 ]
Gunduz, Ahmet [1 ]
Kose, Neslihan [2 ]
Rigoll, Gerhard [1 ]
机构
[1] Tech Univ Munich, Inst Human Machine Commun, Munich, Germany
[2] Intel Deutschland GmbH, Intel Labs Europe, Dependabil Res Lab, Feldkirchen, Germany
关键词
D O I
10.1109/fg.2019.8756576
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Real-time recognition of dynamic hand gestures from video streams is a challenging task since (i) there is no indication when a gesture starts and ends in the video, (ii) performed gestures should only be recognized once, and (iii) the entire architecture should be designed considering the memory and power budget. In this work, we address these challenges by proposing a hierarchical structure enabling offline-working convolutional neural network (CNN) architectures to operate online efficiently by using sliding window approach. The proposed architecture consists of two models: (1) A detector which is a lightweight CNN architecture to detect gestures and (2) a classifier which is a deep CNN to classify the detected gestures. In order to evaluate the single-time activations of the detected gestures, we propose to use Levenshtein distance as an evaluation metric since it can measure misclassifications, multiple detections, and missing detections at the same time. We evaluate our architecture on two publicly available datasets-EgoGesture and NVIDIA Dynamic Hand Gesture Datasets-which require temporal detection and classification of the performed hand gestures. ResNeXt-101 model, which is used as a classifier, achieves the state-of-the-art offline classification accuracy of 94.04% and 83.82% for depth modality on EgoGesture and NVIDIA benchmarks, respectively. In real-time detection and classification, we obtain considerable early detections while achieving performances close to offline operation. The codes and pretrained models used in this work are publicly available(1).
引用
收藏
页码:407 / 414
页数:8
相关论文
共 50 条
  • [31] Real-time vehicle type classification with deep convolutional neural networks
    Xinchen Wang
    Weiwei Zhang
    Xuncheng Wu
    Lingyun Xiao
    Yubin Qian
    Zhi Fang
    [J]. Journal of Real-Time Image Processing, 2019, 16 : 5 - 14
  • [32] Gesture Classification in Electromyography Signals for Real-Time Prosthetic Hand Control Using a Convolutional Neural Network-Enhanced Channel Attention Model
    Yu, Guangjie
    Deng, Ziting
    Bao, Zhenchen
    Zhang, Yue
    He, Bingwei
    Gallo, Crescenzio
    Zaza, Gianluca
    [J]. BIOENGINEERING-BASEL, 2023, 10 (11):
  • [33] A Real-Time Dynamic Gesture Variability Recognition Method Based on Convolutional Neural Networks
    Amangeldy, Nurzada
    Milosz, Marek
    Kudubayeva, Saule
    Kassymova, Akmaral
    Kalakova, Gulsim
    Zhetkenbay, Lena
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (19):
  • [34] Detection of Arrhythmia in Real-time using ECG Signal Analysis and Convolutional Neural Networks
    Reddy, Sashank
    Seshadri, Surabhi B.
    Bothra, G. Sankesh
    Suhas, T. G.
    Thundiyil, Saneesh Cleatus
    [J]. PROCEEDINGS OF 2020 IEEE 21ST INTERNATIONAL CONFERENCE ON COMPUTATIONAL PROBLEMS OF ELECTRICAL ENGINEERING (CPEE), 2020,
  • [35] Real-time license plate detection and recognition using deep convolutional neural networks
    Silva, Sergio Montazzolli
    Jung, Claudio Rosito
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2020, 71
  • [36] Real-Time and Continuous Hand Gesture Spotting: an Approach Based on Artificial Neural Networks
    Neto, Pedro
    Pereira, Dario
    Norberto Pires, J.
    Paulo Moreira, A.
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2013, : 178 - 183
  • [37] Real-Time Hand Gesture Recognition Based on Electromyographic Signals and Artificial Neural Networks
    Motoche, Cristhian
    Benalcazar, Marco E.
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT I, 2018, 11139 : 352 - 361
  • [38] PIPELINE RUPTURE DETECTION USING REAL-TIME TRANSIENT MODELLING AND CONVOLUTIONAL NEURAL NETWORKS
    Smith, Joel
    Chae, Jaehee
    Learn, Shawn
    Hugo, Ron
    Park, Simon
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL PIPELINE CONFERENCE, 2018, VOL 3, 2018,
  • [39] Real-time fingertip localization conditioned on hand gesture classification
    Suau, Xavier
    Alcoverro, Marcel
    Lopez-Mendez, Adolfo
    Ruiz-Hidalgo, Javier
    Casas, Josep R.
    [J]. IMAGE AND VISION COMPUTING, 2014, 32 (08) : 522 - 532
  • [40] Real-Time Age Detection Using a Convolutional Neural Network
    Sithungu, Siphesihle
    Van der Haar, Dustin
    [J]. BUSINESS INFORMATION SYSTEMS, BIS 2019, PT II, 2019, 354 : 245 - 256