Dynamic Gesture Recognition using a Transformer and Mediapipe

被引:0
|
作者
Althubiti, Asma H. [1 ]
Algethami, Haneen [1 ]
机构
[1] Taif Univ, Dept Comp Sci, Coll Comp & Informat Technol, Taif 21944, Saudi Arabia
关键词
Gesture recognition; self-attention; transformer encoder; skeleton; transfer learning;
D O I
10.14569/IJACSA.2024.01506143
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
There is a rising interest in dynamic gesture recognition as a research area. This is the result of emerging global pandemics as well as the need to avoid touching different surfaces. Most of the previous research has focused on implementing deep learning algorithms for the RGB modality. However, despite its potential to enhance the algorithm's performance, gesture recognition has not widely utilised the concept of attention. Most research also used three-dimensional convolutional networks with long short-term memory networks for gesture recognition. However, these networks can be computationally expensive. As a result, this paper employs pre-trained models in conjunction with the skeleton modality to address the challenges posed by background noise. The goal is to present a comparative analysis of various gesture recognition models, divided based on video frames or skeletons. The performance of different models was evaluated using a dataset taken from Kaggle with a size of 2 GB. Each video contains 30 frames (or images) to recognise five gestures. The transformer model for skeleton-based gesture recognition achieves 0.99 accuracy and can be used to capture temporal dependencies in sequential data.
引用
收藏
页码:1424 / 1439
页数:16
相关论文
共 50 条
  • [1] Gesture Recognition Using MediaPipe for Online Realtime Gameplay
    Patel, Urvil
    Rupani, Sourabh
    Saini, Vipin
    Tan, Xing
    2022 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY, WI-IAT, 2022, : 223 - 229
  • [2] CAPSULE TRANSFORMER NETWORK FOR DYNAMIC HAND GESTURE RECOGNITION USING MULTIMODAL DATA
    Lebas, Alexandre
    Slama, Rim
    Wannous, Hazem
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 2130 - 2134
  • [3] A Transformer-Based Network for Dynamic Hand Gesture Recognition
    D'Eusanio, Andrea
    Simoni, Alessandro
    Pini, Stefano
    Borghi, Guido
    Vezzani, Roberto
    Cucchiara, Rita
    2020 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2020), 2020, : 623 - 632
  • [4] Research on Gesture Recognition Based on Improved YOLOv5 and Mediapipe
    Ni, Guangxing
    Xu, Hua
    Wang, Chao
    Computer Engineering and Applications, 60 (07): : 108 - 118
  • [5] Dynamic Hand Gesture Recognition Using Kinect
    Kadethankar, Atharva Ajit
    Joshi, Apurv Dilip
    2017 INNOVATIONS IN POWER AND ADVANCED COMPUTING TECHNOLOGIES (I-PACT), 2017,
  • [6] Dynamic Fingure Gesture Recognition using KINECT
    Varshini, Lavanya M. R.
    Vidhyapathi, C. M.
    PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION CONTROL AND COMPUTING TECHNOLOGIES (ICACCCT), 2016, : 212 - 216
  • [7] Air Writing Recognition Using Mediapipe and Opencv
    Kumar, R. Nitin
    Vaishnavi, Makkena
    Gayatri, K. R.
    Prashanthi, Venigalla
    Supriya, M.
    UBIQUITOUS INTELLIGENT SYSTEMS, 2022, 302 : 447 - 454
  • [8] A Convolutional-Transformer-Based Approach for Dynamic Gesture Recognition of Data Gloves
    Tang, Yingzhe
    Pan, Mingzhang
    Li, Hongqi
    Cao, Xinxin
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 13
  • [9] Calibrating Hand Gesture Recognition for Stroke Rehabilitation Internet-of-Things (RIOT) Using MediaPipe in Smart Healthcare Systems
    Zainuddin, Ahmad Anwar
    Dhuzuki, Nurul Hanis Mohd
    Puzi, Asmarani Ahmad
    Johar, Mohd Naqiuddin
    Yazid, Maslina
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (07) : 568 - 583
  • [10] Dynamic hand gesture recognition using the skeleton of the hand
    Ionescu, B
    Coquin, D
    Lambert, P
    Buzuloiu, V
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2005, 2005 (13) : 2101 - 2109