TIM-SLR: a lightweight network for video isolated sign language recognition

被引:0
|
作者
Fei Wang
Libo Zhang
Hao Yan
Shuai Han
机构
[1] Northeastern University,Faculty of Robot Science and Engineering
[2] Northeastern University,College of Information Science and Engineering
[3] Shengjing Hospital of China Medical University,Department of Neurosurgery
来源
关键词
Sign language recognition; Temporal interaction module; Lightweight network; Isolated sign language dataset;
D O I
暂无
中图分类号
学科分类号
摘要
The research on video isolated sign language recognition (SLR) algorithms has made leaping progress, but there are problems that need to be solved urgently in the field of SLR. On the one hand, traditional sign language acquisition equipment has the disadvantages of being expensive and not easy to carry. Sign language collected based on Kinect contains rich information, but it is complicated to use. The data acquired by RGB cameras are beneficial to practical applications, but the existing sign language datasets collected by RGB cameras have disadvantages such as few demonstrators and small vocabulary. On the other hand, most of the existing SLR methods use complex network structures to achieve high accuracy, but complex networks mean longer inference time, which cannot meet practical application scenarios at all. In this paper, we propose a Chinese large-scale isolated sign language dataset named CSLD, which is collected using RGB camera, and each vocabulary is illustrated 10 times by 30 demonstrators, including 400 words. In addition, we proposed a lightweight TIM-SLR network. In order to verify lightweight and validity of the network, we not only conducted experiments on sign language datasets CSLD and LSA64, and obtained 91.6% and 99.8% accuracy, respectively, but also performed experiments on action recognition datasets Sth-Sth (V1 and V2) and both achieve state-of-the-art performance. Not only can it obtain higher accuracy, but also inference speed and parameter of the network can meet practical application scenarios, because TIM-SLR network is only composed of 2D convolution and temporal interaction module (TIM).
引用
收藏
页码:22265 / 22280
页数:15
相关论文
共 50 条
  • [1] TIM-SLR: a lightweight network for video isolated sign language recognition
    Wang, Fei
    Zhang, Libo
    Yan, Hao
    Han, Shuai
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (30): : 22265 - 22280
  • [2] (2+1)D-SLR: an efficient network for video sign language recognition
    Fei Wang
    Yuxuan Du
    Guorui Wang
    Zhen Zeng
    Lihong Zhao
    Neural Computing and Applications, 2022, 34 : 2413 - 2423
  • [3] (2+1)D-SLR: an efficient network for video sign language recognition
    Wang, Fei
    Du, Yuxuan
    Wang, Guorui
    Zeng, Zhen
    Zhao, Lihong
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (03): : 2413 - 2423
  • [4] A Review on Sign Language Recognition (SLR) System: ML and DL for SLR
    Das, Soumen
    Biswas, Saroj Kr
    Chakraborty, Manomita
    Purkayastha, Biswajit
    2021 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS, SMART AND GREEN TECHNOLOGIES (ICISSGT 2021), 2021, : 177 - 182
  • [6] Survey of Hidden Markov Models (HMMs) for Sign Language Recognition (SLR)
    Sandjaja, Iwan
    Alsharoa, Ahmad
    Wunsch, Donald, II
    Liu, Jian
    2024 IEEE 7TH INTERNATIONAL CONFERENCE ON INDUSTRIAL CYBER-PHYSICAL SYSTEMS, ICPS 2024, 2024,
  • [7] PseudoDepth-SLR: Generating Depth Data for Sign Language Recognition
    Sarhan, Noha
    Willruth, Jan M.
    Fritnrop, Simone
    COMPUTER VISION SYSTEMS, ICVS 2023, 2023, 14253 : 51 - 62
  • [8] Isolated sign language characters recognition
    Santosa, Paulus Insap
    Telkomnika, 2013, 11 (03): : 583 - 590
  • [9] Video-Based Sign Language Recognition via ResNet and LSTM Network
    Huang, Jiayu
    Chouvatut, Varin
    JOURNAL OF IMAGING, 2024, 10 (06)
  • [10] Sign Language Recognition (SLR): A Brisk Paired Deep Metric Attention Learning (BPDMAL) Model for Video Data Applications
    Kishore P.V.V.
    Anil Kumar D.
    Srinivasa Rao K.
    SN Computer Science, 5 (4)