(2+1)D-SLR: an efficient network for video sign language recognition

被引:0
|
作者
Fei Wang
Yuxuan Du
Guorui Wang
Zhen Zeng
Lihong Zhao
机构
[1] Northeastern University,Faculty of Robot Science and Engineering
[2] Northeastern University,College of Information Science and Engineering
来源
Neural Computing and Applications | 2022年 / 34卷
关键词
Sign language recognition; Video understanding; Spatio-temporal features; Large-scale sign language video dataset;
D O I
暂无
中图分类号
学科分类号
摘要
The most existing sign language recognition methods have made significant progress. However, there are still problems in the field of sign language recognition: Traditional SLR technology relies on external devices such as data gloves, position tracker, and has achieved limited success. Moreover, the current state-of-the-art vision-based technologies cannot be applied in practice due to the difficulty in balancing accuracy and speed, because most of them pay the cost of running time for better sign language classification accuracy. In this paper, we propose a (2+1)D-SLR network based on (2+1)D convolution, which is different from other methods in that the proposed network can achieve higher accuracy with a faster speed. Because (2+1)D-SLR can learn spatio-temporal features from the raw sign RGB frames. In addition, the existing Chinese sign language dataset is difficult to guarantee the personality differences between different sign language speakers and the presentation differences of the same presenter. Therefore, we propose a large-scale Chinese sign language video dataset called NCSL to solve this problem, including 300 different sign language vocabulary which demonstrated by 30 volunteers, 10 times each. We also validated our method on NCSL and another large-scale sign language dataset, i.e., LSA64, Achieved 96.4% and 98.7% accuracy, respectively, demonstrating that our method can not only achieve competitive accuracy but be much faster than current well-known sign language recognition methods.
引用
收藏
页码:2413 / 2423
页数:10
相关论文
共 50 条
  • [1] (2+1)D-SLR: an efficient network for video sign language recognition
    Wang, Fei
    Du, Yuxuan
    Wang, Guorui
    Zeng, Zhen
    Zhao, Lihong
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (03): : 2413 - 2423
  • [2] TIM-SLR: a lightweight network for video isolated sign language recognition
    Fei Wang
    Libo Zhang
    Hao Yan
    Shuai Han
    Neural Computing and Applications, 2023, 35 : 22265 - 22280
  • [3] TIM-SLR: a lightweight network for video isolated sign language recognition
    Wang, Fei
    Zhang, Libo
    Yan, Hao
    Han, Shuai
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (30): : 22265 - 22280
  • [4] Sign Language Recognition Based on Deep Learning with Improved (2+1)D-ResNet
    Sheng, Yueqin
    Liu, Qunpo
    Gao, Ruxin
    Hanajima, Naohiko
    JOURNAL OF ROBOTICS NETWORKING AND ARTIFICIAL LIFE, 2022, 9 (03): : 268 - 274
  • [5] A Review on Sign Language Recognition (SLR) System: ML and DL for SLR
    Das, Soumen
    Biswas, Saroj Kr
    Chakraborty, Manomita
    Purkayastha, Biswajit
    2021 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS, SMART AND GREEN TECHNOLOGIES (ICISSGT 2021), 2021, : 177 - 182
  • [6] Sign Language Recognition Based on R(2+1)D With Spatial-Temporal-Channel Attention
    Han, Xiangzu
    Lu, Fei
    Yin, Jianqin
    Tian, Guohui
    Liu, Jun
    IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS, 2022, 52 (04) : 687 - 698
  • [8] AdaBrowse: Adaptive Video Browser for Efficient Continuous Sign Language Recognition
    Hu, Lianyu
    Gao, Liqing
    Liu, Zekang
    Pun, Chi-Man
    Feng, Wei
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 709 - 718
  • [9] SC2SLR: Skeleton-based Contrast for Sign Language Recognition
    Lyu, Silu
    2024 5TH INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKS AND INTERNET OF THINGS, CNIOT 2024, 2024, : 404 - 410
  • [10] C2SLR: Consistency-enhanced Continuous Sign Language Recognition
    Zuo, Ronglai
    Mak, Brian
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 5121 - 5130