Next-Gen Dynamic Hand Gesture Recognition: MediaPipe, Inception-v3 and LSTM-Based Enhanced Deep Learning Model

被引:1
|
作者
Kwon, Oh-Jin [1 ]
Kim, Jaeho [2 ]
Jamil, Sonain [3 ]
Lee, Jinhee [1 ]
Ullah, Faiz [1 ]
机构
[1] Sejong Univ, Dept Elect Engn, Seoul 05006, South Korea
[2] Sejong Univ, Dept Elect Engn, Seoul 05006, South Korea
[3] Norwegian Univ Sci & Technol NTNU, Dept Comp Sci, N-2815 Gjovik, Norway
基金
新加坡国家研究基金会;
关键词
dynamic hand gesture recognition; hybrid deep learning; dimensionality reduction; temporal data classification; transfer learning; vision-based drone control;
D O I
10.3390/electronics13163233
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Gesture recognition is crucial in computer vision-based applications, such as drone control, gaming, virtual and augmented reality (VR/AR), and security, especially in human-computer interaction (HCI)-based systems. There are two types of gesture recognition systems, i.e., static and dynamic. However, our focus in this paper is on dynamic gesture recognition. In dynamic hand gesture recognition systems, the sequences of frames, i.e., temporal data, pose significant processing challenges and reduce efficiency compared to static gestures. These data become multi-dimensional compared to static images because spatial and temporal data are being processed, which demands complex deep learning (DL) models with increased computational costs. This article presents a novel triple-layer algorithm that efficiently reduces the 3D feature map into 1D row vectors and enhances the overall performance. First, we process the individual images in a given sequence using the MediaPipe framework and extract the regions of interest (ROI). The processed cropped image is then passed to the Inception-v3 for the 2D feature extractor. Finally, a long short-term memory (LSTM) network is used as a temporal feature extractor and classifier. Our proposed method achieves an average accuracy of more than 89.7%. The experimental results also show that the proposed framework outperforms existing state-of-the-art methods.
引用
收藏
页数:11
相关论文
共 34 条
  • [1] LSTM-based dynamic probability continuous hand gesture trajectory recognition
    Jian, Chengfeng
    Li, Junjie
    Zhang, Meiyu
    IET IMAGE PROCESSING, 2019, 13 (12) : 2314 - 2320
  • [2] Research on rock hand specimen naming method based on deep learning and Inception-v3 model
    LI Si Jia
    SHEN Yan Jie
    QIAN Ye
    Global Geology, 2024, 27 (04) : 196 - 206
  • [3] Transfer learning based traffic sign recognition using inception-v3 model
    Lin C.
    Li L.
    Luo W.
    Wang K.C.P.
    Guo J.
    Periodica Polytechnica Transportation Engineering, 2019, 47 (03): : 242 - 250
  • [4] Stacked LSTM-Based Dynamic Hand Gesture Recognition with Six-Axis Motion Sensors
    Zhang, Yidi
    Ran, Mengyuan
    Liao, Jun
    Su, Guoxin
    Liu, Ming
    Liu, Li
    2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 2568 - 2575
  • [5] Transfer learning based handwritten character recognition of tamil script using inception-V3 Model
    Gayathri, R.
    Lincy, R. Babitha
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 42 (06) : 6091 - 6102
  • [6] Pulmonary Image Classification Based on Inception-v3 Transfer Learning Model
    Wang, Cheng
    Chen, Delei
    Hao, Lin
    Liu, Xuebo
    Zeng, Yu
    Chen, Jianwei
    Zhang, Guokai
    IEEE ACCESS, 2019, 7 : 146533 - 146541
  • [7] Image Recognition Model of Fraudulent Websites Based on Image Leader Decision and Inception-V3 Transfer Learning
    Shengli Zhou
    Cheng Xu
    Rui Xu
    Weijie Ding
    Chao Chen
    Xiaoyang Xu
    China Communications, 2024, 21 (01) : 215 - 227
  • [8] Image recognition model of fraudulent websites based on image leader decision and Inception-V3 transfer learning
    Zhou, Shengli
    Xu, Cheng
    Xu, Rui
    Ding, Weijie
    Chen, Chao
    Xu, Xiaoyang
    CHINA COMMUNICATIONS, 2024, 21 (01) : 215 - 227
  • [9] Office Garbage Intelligent Classification Based on Inception-v3 Transfer Learning Model
    Feng, Jie-wen
    Tang, Xiao-yu
    2020 4TH INTERNATIONAL CONFERENCE ON CONTROL ENGINEERING AND ARTIFICIAL INTELLIGENCE (CCEAI 2020), 2020, 1487
  • [10] Predicting pedestrian-involved crash severity using inception-v3 deep learning model
    Khan, Md Nasim
    Das, Subasish
    Liu, Jinli
    ACCIDENT ANALYSIS AND PREVENTION, 2024, 197