Dynamic Gesture Recognition using a Transformer and Mediapipe

被引:0
|
作者
Althubiti, Asma H. [1 ]
Algethami, Haneen [1 ]
机构
[1] Taif Univ, Dept Comp Sci, Coll Comp & Informat Technol, Taif 21944, Saudi Arabia
关键词
Gesture recognition; self-attention; transformer encoder; skeleton; transfer learning;
D O I
10.14569/IJACSA.2024.01506143
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
There is a rising interest in dynamic gesture recognition as a research area. This is the result of emerging global pandemics as well as the need to avoid touching different surfaces. Most of the previous research has focused on implementing deep learning algorithms for the RGB modality. However, despite its potential to enhance the algorithm's performance, gesture recognition has not widely utilised the concept of attention. Most research also used three-dimensional convolutional networks with long short-term memory networks for gesture recognition. However, these networks can be computationally expensive. As a result, this paper employs pre-trained models in conjunction with the skeleton modality to address the challenges posed by background noise. The goal is to present a comparative analysis of various gesture recognition models, divided based on video frames or skeletons. The performance of different models was evaluated using a dataset taken from Kaggle with a size of 2 GB. Each video contains 30 frames (or images) to recognise five gestures. The transformer model for skeleton-based gesture recognition achieves 0.99 accuracy and can be used to capture temporal dependencies in sequential data.
引用
收藏
页码:1424 / 1439
页数:16
相关论文
共 50 条
  • [21] DYNAMIC HAND GESTURE RECOGNITION SYSTEM USING NEURAL NETWORK
    Mahanta, Chitralekha
    Yadav, T. Srinivas
    Medhi, Hemanta
    PECCS 2011: PROCEEDINGS OF THE 1ST INTERNATIONAL CONFERENCE ON PERVASIVE AND EMBEDDED COMPUTING AND COMMUNICATION SYSTEMS, 2011, : 253 - 256
  • [22] A dynamic gesture recognition and prediction system using the convexity approach
    Barros, Pablo
    Maciel-Junior, Nestor T.
    Fernandes, Bruno J. T.
    Bezerra, Byron L. D.
    Fernandes, Sergio M. M.
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2017, 155 : 139 - 149
  • [23] Medical gesture recognition using dynamic arc length warping
    Cifuentes, Jenny
    Minh Tu Pham
    Moreau, Richard
    Boulanger, Pierre
    Prieto, Flavio
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2019, 52 : 162 - 170
  • [24] Dynamic gesture recognition using wireless signals with less disturbance
    Chen, Jiahui
    Li, Fan
    Chen, Huijie
    Yang, Song
    Wang, Yu
    PERSONAL AND UBIQUITOUS COMPUTING, 2019, 23 (01) : 17 - 27
  • [25] Human gesture recognition using a simplified dynamic Bayesian network
    Myung-Cheol Roh
    Seong-Whan Lee
    Multimedia Systems, 2015, 21 : 557 - 568
  • [26] A CNN-Transformer Hybrid Recognition Approach for sEMG-Based Dynamic Gesture Prediction
    Liu, Yanhong
    Li, Xingyu
    Yang, Lei
    Bian, Guibin
    Yu, Hongnian
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [27] Human gesture recognition using a simplified dynamic Bayesian network
    Roh, Myung-Cheol
    Lee, Seong-Whan
    MULTIMEDIA SYSTEMS, 2015, 21 (06) : 557 - 568
  • [28] Dynamic gesture recognition using wireless signals with less disturbance
    Jiahui Chen
    Fan Li
    Huijie Chen
    Song Yang
    Yu Wang
    Personal and Ubiquitous Computing, 2019, 23 : 17 - 27
  • [29] Dynamic Hand Gesture Recognition Using Hidden Markov Models
    Yang, Zhong
    Li, Yi
    Chen, Weidong
    Zheng, Yang
    PROCEEDINGS OF 2012 7TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION, VOLS I-VI, 2012, : 360 - 365
  • [30] Dynamic Gesture Recognition using 3D Trajectory
    Wang, Qianqian
    Xu, Yuan-Rong
    Bai, Xiao
    Xu, Dan
    Chen, Yen-Lun
    Wu, Xinyu
    2014 4TH IEEE INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST), 2014, : 598 - 601