A Spatio-Temporal Framework for Dynamic Indian Sign Language Recognition

被引:2
|
作者
Sharma, Sakshi [1 ]
Singh, Sukhwinder [1 ]
机构
[1] Punjab Engn Coll, ECE Dept, Chandigarh, India
关键词
Convolutional neural network; Deep learning; Long short-term memory; Indian sign language; Sign language recognition system;
D O I
10.1007/s11277-023-10730-8
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
A sign language recognition system is a boon to the signer community as it eases the flow of information between the signer and non-signer communities. However, extracting timely detail from the video data is still a challenging task. In this paper, a deep learning based model consisting of trainable CNN and trainable stacked 2 bidirectional long short term memory (S2B-LSTM) has been proposed and tested to recognise the dynamic gestures of Indian sign language (ISL). The CNN architecture has been used as feature extractor to extract the spatial features from the input video data, whereas the temporal relation between the consecutive frames of input video is extracted using S2B-LSTM. This model has been trained and tested on self-developed dataset consisting of 360 videos of ISL dynamic gestures. The CNN-S2B-LSTM model outperforms the existing techniques of sign language recognition with best recognition accuracy of 97.6%.
引用
收藏
页码:2527 / 2541
页数:15
相关论文
共 50 条
  • [41] Continuous human action segmentation and recognition using a spatio-temporal probabilistic framework
    Chen, Duan-Yu
    Liao, Hong-Yuan Mark
    Shih, Sheng-Wen
    ISM 2006: EIGHTH IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA, PROCEEDINGS, 2006, : 275 - +
  • [42] Continuous Dynamic Indian Sign Language Gesture Recognition with Invariant Backgrounds
    Tripathi, Kumud
    Baranwal, Neha
    Nandi, G. C.
    2015 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2015, : 2211 - 2216
  • [43] A spatio-temporal, functional classification of Indian cities
    Pomeroy, G
    CHALLENGES TO ASIAN URBANIZATION IN THE 21ST CENTURY, 2003, 75 : 137 - 161
  • [44] Dynamic proximity of spatio-temporal sequences
    Horn, D
    Dror, G
    Quenet, B
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2004, 15 (05): : 1002 - 1008
  • [45] 3D sign language recognition using spatio temporal graph kernels
    Kumar, D. Anil
    Sastry, A. S. C. S.
    Kishore, P. V. V.
    Kumar, E. Kiran
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (02) : 143 - 152
  • [46] Dynamic Spatio-Temporal Specialization Learning for Fine-Grained Action Recognition
    Li, Tianjiao
    Foo, Lin Geng
    Ke, Qiuhong
    Rahmani, Hossein
    Wang, Anran
    Wang, Jinghua
    Liu, Jun
    COMPUTER VISION - ECCV 2022, PT IV, 2022, 13664 : 386 - 403
  • [47] A Comparison of Wavelet Based Spatio-temporal Decomposition Methods for Dynamic Texture Recognition
    Dubois, Sloven
    Peteri, Renaud
    Menard, Michel
    PATTERN RECOGNITION AND IMAGE ANALYSIS, PROCEEDINGS, 2009, 5524 : 314 - +
  • [48] Runtime Verification of Spatio-Temporal Specification Language
    Tengfei Li
    Jing Liu
    Haiying Sun
    Xiaohong Chen
    Ling Yin
    Xia Mao
    Junfeng Sun
    Mobile Networks and Applications, 2021, 26 : 2392 - 2406
  • [49] Spatio-temporal processing for distant speech recognition
    Low, SY
    Togneri, R
    Nordholm, S
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 1001 - 1004
  • [50] Spatio-Temporal Context Kernel for Activity Recognition
    Yuan, Fei
    Sahbi, Hichem
    Prinet, Veronique
    2011 FIRST ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2011, : 436 - 440