Video-Based Sign Language Recognition via ResNet and LSTM Network

被引:0
|
作者
Huang, Jiayu [1 ]
Chouvatut, Varin [1 ]
机构
[1] Chiang Mai Univ, Fac Sci, Dept Comp Sci, Chiang Mai 50200, Thailand
关键词
sign language recognition; deep learning; ResNet; LSTM;
D O I
10.3390/jimaging10060149
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
Sign language recognition technology can help people with hearing impairments to communicate with non-hearing-impaired people. At present, with the rapid development of society, deep learning also provides certain technical support for sign language recognition work. In sign language recognition tasks, traditional convolutional neural networks used to extract spatio-temporal features from sign language videos suffer from insufficient feature extraction, resulting in low recognition rates. Nevertheless, a large number of video-based sign language datasets require a significant amount of computing resources for training while ensuring the generalization of the network, which poses a challenge for recognition. In this paper, we present a video-based sign language recognition method based on Residual Network (ResNet) and Long Short-Term Memory (LSTM). As the number of network layers increases, the ResNet network can effectively solve the granularity explosion problem and obtain better time series features. We use the ResNet convolutional network as the backbone model. LSTM utilizes the concept of gates to control unit states and update the output feature values of sequences. ResNet extracts the sign language features. Then, the learned feature space is used as the input of the LSTM network to obtain long sequence features. It can effectively extract the spatio-temporal features in sign language videos and improve the recognition rate of sign language actions. An extensive experimental evaluation demonstrates the effectiveness and superior performance of the proposed method, with an accuracy of 85.26%, F1-score of 84.98%, and precision of 87.77% on Argentine Sign Language (LSA64).
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Video-Based Chinese Sign Language Recognition Using Convolutional Neural Network
    Yang, Su
    Zhu, Qing
    [J]. 2017 IEEE 9TH INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN), 2017, : 929 - 934
  • [2] Benchmark Databases for Video-Based Automatic Sign Language Recognition
    Dreuw, Philippe
    Neidle, Carol
    Athitsos, Vassilis
    Sclaroff, Stan
    Ney, Hermann
    [J]. SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 1115 - 1120
  • [3] Video-Based Sign Language Recognition without Temporal Segmentation
    Huang, Jie
    Zhou, Wengang
    Zhang, Qilin
    Li, Houqiang
    Li, Weiping
    [J]. THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 2257 - 2264
  • [4] Isolated Video-Based Sign Language Recognition Using a Hybrid CNN-LSTM Framework Based on Attention Mechanism
    Kumari, Diksha
    Anand, Radhey Shyam
    [J]. ELECTRONICS, 2024, 13 (07)
  • [5] Video-based continuous sign language recognition using statistical methods
    Bauer, B
    Hienz, H
    Kraiss, KF
    [J]. 15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, PROCEEDINGS: PATTERN RECOGNITION AND NEURAL NETWORKS, 2000, : 463 - 466
  • [6] Video-Based Vietnamese Sign Language Recognition Using Local Descriptors
    Vo, Anh H.
    Nguyen, Nhu T. Q.
    Nguyen, Ngan T. B.
    Van-Huy Pham
    Van Giap, Ta
    Nguyen, Bao T.
    [J]. INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2019, PT II, 2019, 11432 : 680 - 693
  • [7] Chinese Sign Language Recognition with Batch Sampling ResNet-Bi-LSTM
    Chung W.-Y.
    Xu H.
    Lee B.G.
    [J]. SN Computer Science, 3 (5)
  • [8] Video-based feature extraction techniques for isolated Arabic Sign Language recognition
    Shanableh, T.
    Assaleh, K.
    [J]. 2007 9TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1-3, 2007, : 536 - +
  • [9] Video-based traffic sign detection and recognition
    Zhao, Qiuyu
    Shen, Yongliang
    Zhang, Yi
    [J]. 2019 INTERNATIONAL CONFERENCE ON IMAGE AND VIDEO PROCESSING, AND ARTIFICIAL INTELLIGENCE, 2019, 11321
  • [10] Research on Video-based Traffic Sign Recognition
    Sun, Yuge
    Li, Lei
    Ye, Ning
    Zhao, Lihong
    Lei, Hongwei
    Yang, Jie
    Sheng, Weihua
    [J]. 2017 IEEE 7TH ANNUAL INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (CYBER), 2017, : 1500 - 1505