Next-Gen Dynamic Hand Gesture Recognition: MediaPipe, Inception-v3 and LSTM-Based Enhanced Deep Learning Model

被引:1
|
作者
Kwon, Oh-Jin [1 ]
Kim, Jaeho [2 ]
Jamil, Sonain [3 ]
Lee, Jinhee [1 ]
Ullah, Faiz [1 ]
机构
[1] Sejong Univ, Dept Elect Engn, Seoul 05006, South Korea
[2] Sejong Univ, Dept Elect Engn, Seoul 05006, South Korea
[3] Norwegian Univ Sci & Technol NTNU, Dept Comp Sci, N-2815 Gjovik, Norway
基金
新加坡国家研究基金会;
关键词
dynamic hand gesture recognition; hybrid deep learning; dimensionality reduction; temporal data classification; transfer learning; vision-based drone control;
D O I
10.3390/electronics13163233
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Gesture recognition is crucial in computer vision-based applications, such as drone control, gaming, virtual and augmented reality (VR/AR), and security, especially in human-computer interaction (HCI)-based systems. There are two types of gesture recognition systems, i.e., static and dynamic. However, our focus in this paper is on dynamic gesture recognition. In dynamic hand gesture recognition systems, the sequences of frames, i.e., temporal data, pose significant processing challenges and reduce efficiency compared to static gestures. These data become multi-dimensional compared to static images because spatial and temporal data are being processed, which demands complex deep learning (DL) models with increased computational costs. This article presents a novel triple-layer algorithm that efficiently reduces the 3D feature map into 1D row vectors and enhances the overall performance. First, we process the individual images in a given sequence using the MediaPipe framework and extract the regions of interest (ROI). The processed cropped image is then passed to the Inception-v3 for the 2D feature extractor. Finally, a long short-term memory (LSTM) network is used as a temporal feature extractor and classifier. Our proposed method achieves an average accuracy of more than 89.7%. The experimental results also show that the proposed framework outperforms existing state-of-the-art methods.
引用
收藏
页数:11
相关论文
共 34 条
  • [21] Dynamic Hand Gesture Recognition Based on Signals From Specialized Data Glove and Deep Learning Algorithms
    Dong, Yongfeng
    Liu, Jielong
    Yan, Wenjie
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2021, 70
  • [22] A Deep Bidirectional LSTM Model Enhanced by Transfer-Learning-Based Feature Extraction for Dynamic Human Activity Recognition
    Hassan, Najmul
    Miah, Abu Saleh Musa
    Shin, Jungpil
    APPLIED SCIENCES-BASEL, 2024, 14 (02):
  • [23] Deep Learning Model for Dynamic Hand Gesture Recognition for Natural Human- Machine Interface on End Devices
    Chang, Tsui-Ping
    Chen, Hung-Ming
    Chen, Shih-Ying
    Lin, Wei-Cheng
    INTERNATIONAL JOURNAL OF INFORMATION SYSTEM MODELING AND DESIGN, 2022, 13 (10)
  • [24] Investigations on Deep Learning Pre-trained Model Inception-V3 Using Transfer Learning for Remote Sensing Image Classification on Benchmark Datasets
    Gupta, Nisha
    Singh, Satvir
    Singh, Jagtar
    Mittal, Ajay
    Joshi, Garima
    FOURTH CONGRESS ON INTELLIGENT SYSTEMS, VOL 2, CIS 2023, 2024, 869 : 223 - 234
  • [25] LSTM-MSA: A Novel Deep Learning Model With Dual-Stage Attention Mechanisms Forearm EMG-Based Hand Gesture Recognition
    Zhang, Haotian
    Qu, Hang
    Teng, Long
    Tang, Chak-Yin
    IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2023, 31 : 4749 - 4759
  • [26] Skeleton-Based Dynamic Hand Gesture Recognition Using an Enhanced Network with One-Shot Learning
    Ma, Chunyong
    Zhang, Shengsheng
    Wang, Anni
    Qi, Yongyang
    Chen, Ge
    APPLIED SCIENCES-BASEL, 2020, 10 (11):
  • [27] Automatic 3D Skeleton-based Dynamic Hand Gesture Recognition Using Multi-Layer Convolutional LSTM
    Mohammed, Adam A. Q.
    Gao, Yuan
    Ji, Zhilong
    Lv, Jiancheng
    Islam, Md Sajjatul
    Sang, Yongsheng
    2021 7TH INTERNATIONAL CONFERENCE ON ROBOTICS AND ARTIFICIAL INTELLIGENCE, ICRAI 2021, 2021, : 8 - 14
  • [28] A Novel Mems and Flex Sensor-Based Hand Gesture Recognition and Regenerating System Using Deep Learning Model
    Sumbul, Harun
    IEEE ACCESS, 2024, 12 : 133685 - 133693
  • [29] Deep learning framework based on integration of S-Mask R-CNN and Inception-v3 for ultrasound image-aided diagnosis of prostate cancer
    Liu, Zhiyong
    Yang, Chuan
    Huang, Jun
    Liu, Shaopeng
    Zhuo, Yumin
    Lu, Xu
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 114 : 358 - 367
  • [30] Deep Learning based Model for Detection of Vitiligo Skin Disease using Pre-trained Inception V3
    Sharma, Shagun
    Guleria, Kalpna
    Kumar, Sushil
    Tiwari, Sunita
    INTERNATIONAL JOURNAL OF MATHEMATICAL ENGINEERING AND MANAGEMENT SCIENCES, 2023, 8 (05) : 1024 - 1039