Dynamic Gesture Recognition using a Transformer and Mediapipe

被引:0
|
作者
Althubiti, Asma H. [1 ]
Algethami, Haneen [1 ]
机构
[1] Taif Univ, Dept Comp Sci, Coll Comp & Informat Technol, Taif 21944, Saudi Arabia
关键词
Gesture recognition; self-attention; transformer encoder; skeleton; transfer learning;
D O I
10.14569/IJACSA.2024.01506143
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
There is a rising interest in dynamic gesture recognition as a research area. This is the result of emerging global pandemics as well as the need to avoid touching different surfaces. Most of the previous research has focused on implementing deep learning algorithms for the RGB modality. However, despite its potential to enhance the algorithm's performance, gesture recognition has not widely utilised the concept of attention. Most research also used three-dimensional convolutional networks with long short-term memory networks for gesture recognition. However, these networks can be computationally expensive. As a result, this paper employs pre-trained models in conjunction with the skeleton modality to address the challenges posed by background noise. The goal is to present a comparative analysis of various gesture recognition models, divided based on video frames or skeletons. The performance of different models was evaluated using a dataset taken from Kaggle with a size of 2 GB. Each video contains 30 frames (or images) to recognise five gestures. The transformer model for skeleton-based gesture recognition achieves 0.99 accuracy and can be used to capture temporal dependencies in sequential data.
引用
收藏
页码:1424 / 1439
页数:16
相关论文
共 50 条
  • [31] Recognition of Dynamic Hand Gesture using Hidden Markov Model
    Lynn, Kok Yi
    Wong, Farrah
    2022 INTERNATIONAL CONFERENCE ON GREEN ENERGY, COMPUTING AND SUSTAINABLE TECHNOLOGY (GECOST), 2022, : 419 - 422
  • [32] TraHGR: Transformer for Hand Gesture Recognition via Electromyography
    Zabihi, Soheil
    Rahimian, Elahe
    Asif, Amir
    Mohammadi, Arash
    IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2023, 31 : 4211 - 4224
  • [33] MGRFormer: A Multimodal Transformer Approach for Surgical Gesture Recognition
    Feghoul, Kevin
    Maia, Deise Santana
    El Amrani, Mehdi
    Daoudi, Mohamed
    Amad, Ali
    2024 IEEE 18TH INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION, FG 2024, 2024,
  • [34] Multimodal Gesture Recognition with Spatio-Temporal Features Fusion Based on YOLOv5 and MediaPipe
    Cao, Wenyi
    Lu, Peiqi
    Cao, Wenxin
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2024, 38 (08)
  • [35] Dynamic ROI Extraction for Palmprints using MediaPipe Hands
    Kocakulak, Mustafa
    Acir, Nurettin
    2022 30TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU, 2022,
  • [36] Gesture Interaction System Design for Telerehabilitation Based on Mediapipe
    Zhai, Zhen
    2022 2ND IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND ARTIFICIAL INTELLIGENCE (SEAI 2022), 2022, : 279 - 283
  • [37] WAVEGLOVE: TRANSFORMER-BASED HAND GESTURE RECOGNITION USING MULTIPLE INERTIAL SENSORS
    Kralik, Matej
    Suppa, Marek
    29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 1576 - 1580
  • [38] Dynamic Hand Gesture Recognition Framework
    Premaratne, Prashan
    Yang, Shuai
    Zhou, ZhengMao
    Bandara, Nalin
    INTELLIGENT COMPUTING METHODOLOGIES, 2014, 8589 : 834 - 845
  • [39] A Dynamic Gesture and Posture Recognition System
    Sgouropoulos, Kyriakos
    Stergiopoulou, Ekaterini
    Papamarkos, Nikos
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2014, 76 (02) : 283 - 296
  • [40] Dynamic Gesture Recognition for Social Robots
    Carlos Castillo, Jose
    Caceres-Dominguez, David
    Alonso-Martin, Fernando
    Castro-Gonzalez, Alvaro
    Angel Salichs, Miguel
    SOCIAL ROBOTICS, ICSR 2017, 2017, 10652 : 495 - 505