Real-time Hand Gesture Detection and Classification Using Convolutional Neural Networks

被引:109
|
作者
Koepueklue, Okan [1 ]
Gunduz, Ahmet [1 ]
Kose, Neslihan [2 ]
Rigoll, Gerhard [1 ]
机构
[1] Tech Univ Munich, Inst Human Machine Commun, Munich, Germany
[2] Intel Deutschland GmbH, Intel Labs Europe, Dependabil Res Lab, Feldkirchen, Germany
关键词
D O I
10.1109/fg.2019.8756576
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Real-time recognition of dynamic hand gestures from video streams is a challenging task since (i) there is no indication when a gesture starts and ends in the video, (ii) performed gestures should only be recognized once, and (iii) the entire architecture should be designed considering the memory and power budget. In this work, we address these challenges by proposing a hierarchical structure enabling offline-working convolutional neural network (CNN) architectures to operate online efficiently by using sliding window approach. The proposed architecture consists of two models: (1) A detector which is a lightweight CNN architecture to detect gestures and (2) a classifier which is a deep CNN to classify the detected gestures. In order to evaluate the single-time activations of the detected gestures, we propose to use Levenshtein distance as an evaluation metric since it can measure misclassifications, multiple detections, and missing detections at the same time. We evaluate our architecture on two publicly available datasets-EgoGesture and NVIDIA Dynamic Hand Gesture Datasets-which require temporal detection and classification of the performed hand gestures. ResNeXt-101 model, which is used as a classifier, achieves the state-of-the-art offline classification accuracy of 94.04% and 83.82% for depth modality on EgoGesture and NVIDIA benchmarks, respectively. In real-time detection and classification, we obtain considerable early detections while achieving performances close to offline operation. The codes and pretrained models used in this work are publicly available(1).
引用
收藏
页码:407 / 414
页数:8
相关论文
共 50 条
  • [1] Hand Gesture Detection with Convolutional Neural Networks
    Alashhab, Samer
    Gallego, Antonio-Javier
    Angel Lozano, Miguel
    [J]. DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE, 2019, 800 : 45 - 52
  • [2] Real-time arrhythmia detection using convolutional neural networks
    Vu, Thong
    Petty, Tyler
    Yakut, Kemal
    Usman, Muhammad
    Xue, Wei
    Haas, Francis M.
    Hirsh, Robert A.
    Zhao, Xinghui
    [J]. FRONTIERS IN BIG DATA, 2023, 6
  • [3] Real-Time Analysis of Hand Gesture Recognition with Temporal Convolutional Networks
    Tsinganos, Panagiotis
    Jansen, Bart
    Cornelis, Jan
    Skodras, Athanassios
    [J]. SENSORS, 2022, 22 (05)
  • [4] Real-Time Pedestrian Detection Using Convolutional Neural Networks
    Kuang, Ping
    Ma, Tingsong
    Li, Fan
    Chen, Ziwei
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2018, 32 (11)
  • [5] Real-Time Grasp Detection Using Convolutional Neural Networks
    Redmon, Joseph
    Angelova, Anelia
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2015, : 1316 - 1322
  • [6] Real-Time classification of Plankton species using Convolutional Neural Networks
    Nandini, Tata Sai
    Swethaa, S.
    Bolem, Srinivas
    Dharani, G.
    Thangarasu, Sivasakthi
    [J]. OCEANS 2022, 2022,
  • [7] Intelligent and Real-Time Detection and Classification Algorithm for Recycled Materials Using Convolutional Neural Networks
    Ziouzios, Dimitris
    Baras, Nikolaos
    Balafas, Vasileios
    Dasygenis, Minas
    Stimoniaris, Adam
    [J]. RECYCLING, 2022, 7 (01)
  • [8] Real-Time Hand Gesture Recognition Using Fine-Tuned Convolutional Neural Network
    Sahoo, Jaya Prakash
    Prakash, Allam Jaya
    Plawiak, Pawel
    Samantray, Saunak
    [J]. SENSORS, 2022, 22 (03)
  • [9] Incorporating Stereo with Convolutional Neural Networks for Real-Time Fish Detection and Classification
    Wu, Zong-Yao
    Tseng, Shih-Lun
    Lin, Huei-Yung
    Chen, Hsin-Yi
    Tran Van Luan
    [J]. PROCEEDINGS OF THE IEEE 2019 9TH INTERNATIONAL CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEMS (CIS) ROBOTICS, AUTOMATION AND MECHATRONICS (RAM) (CIS & RAM 2019), 2019, : 83 - 88
  • [10] Real-Time Hand Detection using Convolutional Neural Networks for Costa Rican Sign Language Recognition
    Zamora-Mora, Juan
    Chacon-Rivas, Mario
    [J]. 2019 INTERNATIONAL CONFERENCE ON INCLUSIVE TECHNOLOGIES AND EDUCATION (CONTIE 2019), 2019, : 180 - 186