Spatiotemporal features representation with dynamic mode decomposition for hand gesture recognition using deep neural networks

被引:3
|
作者
Sharma, Bhavana [1 ]
Panda, Jeebananda [1 ]
机构
[1] DTU, Dept Elect & Commun Engn, New Delhi 110042, India
关键词
Hand gesture recognition; Dynamic mode decomposition (DMD); Time dynamics; Spatiotemporal features; Deep neural network;
D O I
10.1007/s11760-024-03038-y
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Hand Gesture Recognition (HGR) with complexity and diversity of hand images in uncontrolled environment is a challenging task because of complex backgrounds, light illumination, strong occlusions, blur motion. This work provides a thorough examination of spatiotemporal feature extraction with deep learning model in order to overcome practical variations in lighting and fluctuations of physical hand's movement in both space and time. The hand skin color is first filtered through YCbCr color space and in order to train the hand images, MediaPipe is used to distinguish the specific gesture region. With respect to spatial variations, the spatiotemporal features extraction is done by Dynamic Mode Decomposition (DMD) technique, where hand key features are decoupled with time dynamics and modes in order to obtain time-frequency analysis. Thus, the received reconstructed signal has an enhanced visibility of skin-color pixels. The extensive experiment is demonstrated by deep neural network ResNet18 for better classification on three publicly available datasets, namely, Ego hand dataset, American Sign Language (ASL) dataset and Senz3D dataset. This work outplays existing state-of-arts methods remarkable regarding spatiotemporal features extraction with an accuracy of Ego hand dataset is 97.85% and ASL dataset is 98.49% at specific dynamic modes three, whereas Senz3D dataset achieves 98.51% classification accuracy at dynamic mode two. We have obtained a competitive outcome when comparing the State-Of-The-Art (SOTA) techniques available for HGR.
引用
收藏
页码:3745 / 3759
页数:15
相关论文
共 50 条
  • [1] Spatiotemporal features representation with dynamic mode decomposition for hand gesture recognition using deep neural networks
    Bhavana Sharma
    Jeebananda Panda
    Signal, Image and Video Processing, 2024, 18 : 3745 - 3759
  • [2] Deep Neural Networks vs Bag of Features for Hand Gesture Recognition
    Mirsu, Radu
    Simion, Georgiana
    Caleanu, Catlin Daniel
    Ursulescu, Oana
    Calimanu, Ioana Pop
    2019 42ND INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2019, : 369 - 372
  • [3] Hand Gesture Recognition Using Deep Convolutional Neural Networks
    Strezoski, Gjorgji
    Stojanovski, Dario
    Dimitrovski, Ivica
    Madjarov, Gjorgji
    ICT INNOVATIONS 2016: COGNITIVE FUNCTIONS AND NEXT GENERATION ICT SYSTEMS, 2018, 665 : 49 - 58
  • [4] Dynamic Hand Gesture Recognition Using Computer Vision and Neural Networks
    Munasinghe, M. I. N. P.
    2018 3RD INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2018,
  • [5] Hand Gesture Recognition using Neural Networks
    Murthy, G. R. S.
    Jadon, R. S.
    2010 IEEE 2ND INTERNATIONAL ADVANCE COMPUTING CONFERENCE, 2010, : 134 - 138
  • [6] Semantic Segmentation based Hand Gesture Recognition using Deep Neural Networks
    Dutta, H. Pallab Jyoti
    Sarma, Debajit
    Bhuyan, M. K.
    Laskar, R. H.
    2020 TWENTY SIXTH NATIONAL CONFERENCE ON COMMUNICATIONS (NCC 2020), 2020,
  • [7] Deep Dynamic Neural Networks for Gesture Segmentation and Recognition
    Wu, Di
    Shao, Ling
    COMPUTER VISION - ECCV 2014 WORKSHOPS, PT I, 2015, 8925 : 552 - 571
  • [8] Hand Gesture Recognition using Convolutional Neural Networks
    Lan, Shengchang
    He, Zonglong
    Chen, Weichu
    Chen, Lijia
    2018 USNC-URSI RADIO SCIENCE MEETING (JOINT WITH AP-S SYMPOSIUM), 2018, : 147 - 148
  • [9] Hand Gesture Recognition in Video Sequences Using Deep Convolutional and Recurrent Neural Networks
    Obaid, Falah
    Babadi, Amin
    Yoosofan, Ahmad
    APPLIED COMPUTER SYSTEMS, 2020, 25 (01) : 57 - 61
  • [10] Dynamic Hand Gesture Recognition Using Generalized Time Warping and Deep Belief Networks
    Torres-Valencia, Cristian A.
    Garcia, Hernan F.
    Holguin, German A.
    Alvarez, Mauricio A.
    Orozco, Alvaro
    ADVANCES IN VISUAL COMPUTING, PT II (ISVC 2015), 2015, 9475 : 682 - 691