TIM-SLR: a lightweight network for video isolated sign language recognition

被引:0
|
作者
Fei Wang
Libo Zhang
Hao Yan
Shuai Han
机构
[1] Northeastern University,Faculty of Robot Science and Engineering
[2] Northeastern University,College of Information Science and Engineering
[3] Shengjing Hospital of China Medical University,Department of Neurosurgery
来源
关键词
Sign language recognition; Temporal interaction module; Lightweight network; Isolated sign language dataset;
D O I
暂无
中图分类号
学科分类号
摘要
The research on video isolated sign language recognition (SLR) algorithms has made leaping progress, but there are problems that need to be solved urgently in the field of SLR. On the one hand, traditional sign language acquisition equipment has the disadvantages of being expensive and not easy to carry. Sign language collected based on Kinect contains rich information, but it is complicated to use. The data acquired by RGB cameras are beneficial to practical applications, but the existing sign language datasets collected by RGB cameras have disadvantages such as few demonstrators and small vocabulary. On the other hand, most of the existing SLR methods use complex network structures to achieve high accuracy, but complex networks mean longer inference time, which cannot meet practical application scenarios at all. In this paper, we propose a Chinese large-scale isolated sign language dataset named CSLD, which is collected using RGB camera, and each vocabulary is illustrated 10 times by 30 demonstrators, including 400 words. In addition, we proposed a lightweight TIM-SLR network. In order to verify lightweight and validity of the network, we not only conducted experiments on sign language datasets CSLD and LSA64, and obtained 91.6% and 99.8% accuracy, respectively, but also performed experiments on action recognition datasets Sth-Sth (V1 and V2) and both achieve state-of-the-art performance. Not only can it obtain higher accuracy, but also inference speed and parameter of the network can meet practical application scenarios, because TIM-SLR network is only composed of 2D convolution and temporal interaction module (TIM).
引用
收藏
页码:22265 / 22280
页数:15
相关论文
共 50 条
  • [21] LGF-SLR: Hand Local-Global Fusion Network for Skeleton-Based Sign Language Recognition
    Gao, Qing
    Zhang, Meiqi
    Ju, Zhaojie
    IEEE SENSORS JOURNAL, 2025, 25 (05) : 8586 - 8597
  • [22] Video-based isolated hand sign language recognition using a deep cascaded model
    Rastgoo, Razieh
    Kiani, Kourosh
    Escalera, Sergio
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (31-32) : 22965 - 22987
  • [23] Gesture recognition for sign language Video Stream Translation
    Bai Fei
    Jiang Xuemei
    Hu Jiwei
    Lou Ping
    2020 5TH INTERNATIONAL CONFERENCE ON MECHANICAL, CONTROL AND COMPUTER ENGINEERING (ICMCCE 2020), 2020, : 1311 - 1315
  • [24] Moroccan Sign Language Video Recognition with Deep Learning
    Boukdir, Abdelbasset
    Benaddy, Mohamed
    El Meslouhi, Othmane
    Kardouchi, Mustapha
    Akhloufi, Moulay
    PROCEEDINGS OF SEVENTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, ICICT 2022, VOL 1, 2023, 447 : 415 - 422
  • [25] A Brazilian Sign Language Video Database for Automatic Recognition
    Gameiro, Priscila, V
    Passos, Wesley L.
    Araujo, Gabriel M.
    de Lima, Amaro A.
    Gois, Jonathan N.
    Corbo, Anna R.
    2020 XVIII LATIN AMERICAN ROBOTICS SYMPOSIUM, 2020 XII BRAZILIAN SYMPOSIUM ON ROBOTICS AND 2020 XI WORKSHOP OF ROBOTICS IN EDUCATION (LARS-SBR-WRE 2020), 2020, : 61 - 66
  • [26] Bangladeshi Hand Sign Language Recognition from Video
    Santa, Umme
    Tazreen, Farzana
    Chowdhury, Shayhan Ameen
    2017 20TH INTERNATIONAL CONFERENCE OF COMPUTER AND INFORMATION TECHNOLOGY (ICCIT), 2017,
  • [27] Isolated Word Sign Language Recognition Based on Improved SKResNet-TCN Network
    Xu, Xuebin
    Meng, Kan
    Chen, Chen
    Lu, Longbin
    JOURNAL OF SENSORS, 2023, 2023
  • [28] SC2SLR: Skeleton-based Contrast for Sign Language Recognition
    Lyu, Silu
    2024 5TH INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKS AND INTERNET OF THINGS, CNIOT 2024, 2024, : 404 - 410
  • [29] C2SLR: Consistency-enhanced Continuous Sign Language Recognition
    Zuo, Ronglai
    Mak, Brian
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 5121 - 5130
  • [30] SLOWFAST NETWORK FOR CONTINUOUS SIGN LANGUAGE RECOGNITION
    Ahn, Junseok
    Jang, Youngjoon
    Chung, Joon Son
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 3920 - 3924