Efficient Sign Language Recognition System and Dataset Creation Method Based on Deep Learning and Image Processing

被引:2
|
作者
Cavalcante Carneiro, A. L. [1 ]
Silva, L. Brito [1 ]
Pinheiro Salvadeo, D. H. [1 ]
机构
[1] State Univ Sao Paulo, Dept Stat Appl Math & Computat, Av 24A,1515, BR-13506700 Rio Claro, SP, Brazil
关键词
Deep learning; sign language recognition; convolutional neural networks; image classification; object detection;
D O I
10.1117/12.2601018
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
New deep-learning architectures are created every year, achieving state-of-the-art results in image recognition and leading to the belief that, in a few years, complex tasks such as sign language translation will be considerably easier, serving as a communication tool for the hearing-impaired community. On the other hand, these algorithms still need a lot of data to be trained and the dataset creation process is expensive, time-consuming, and slow. Thereby, this work aims to investigate techniques of digital image processing and machine learning that can be used to create a sign language dataset effectively. We argue about data acquisition, such as the frames per second rate to capture or subsample the videos, the background type, preprocessing, and data augmentation, using convolutional neural networks and object detection to create an image classifier and comparing the results based on statistical tests. Different datasets were created to test the hypotheses, containing 14 words used daily and recorded by different smartphones in the RGB color system. We achieved an accuracy of 96.38% on the test set and 81.36% on the validation set containing more challenging conditions, showing that 30 FPS is the best frame rate subsample to train the classifier, geometric transformations work better than intensity transformations, and artificial background creation is not effective to model generalization. These trade-offs should be considered in future work as a cost-benefit guideline between computational cost and accuracy gain when creating a dataset and training a sign recognition model.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Indian Sign Language Gesture Recognition using Image Processing and Deep Learning
    Bhagat, Neel Kamal
    Vishnusai, Y.
    Rathna, G. N.
    [J]. 2019 DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2019, : 61 - 68
  • [2] A Deep Learning based Recognition System for Yemeni Sign Language
    Dabwan, Basel A.
    Jadhav, Mukti E.
    [J]. 2021 INTERNATIONAL CONFERENCE OF MODERN TRENDS IN INFORMATION AND COMMUNICATION TECHNOLOGY INDUSTRY (MTICTI 2021), 2021, : 20 - 24
  • [3] Dataset Transformation System for Sign Language Recognition Based on Image Classification Network
    Choi, Sang-Geun
    Park, Yeonji
    Sohn, Chae-Bong
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (19):
  • [4] Efficient deep learning models based on tension techniques for sign language recognition
    Attia N.F.
    Ahmed M.T.F.S.
    Alshewimy M.A.M.
    [J]. Intelligent Systems with Applications, 2023, 20
  • [5] Image-Based Arabic Sign Language Recognition System Using Transfer Deep Learning Models
    Bani Baker, Qanita
    Alqudah, Nour
    Alsmadi, Tibra
    Awawdeh, Rasha
    [J]. APPLIED COMPUTATIONAL INTELLIGENCE AND SOFT COMPUTING, 2023, 2023
  • [6] Pakistan sign language recognition: leveraging deep learning models with limited dataset
    Hafiz Muhammad Hamza
    Aamir Wali
    [J]. Machine Vision and Applications, 2023, 34
  • [7] Pakistan sign language recognition: leveraging deep learning models with limited dataset
    Hamza, Hafiz Muhammad
    Wali, Aamir
    [J]. MACHINE VISION AND APPLICATIONS, 2023, 34 (05)
  • [8] Review of Sign Language Recognition Based on Deep Learning
    Zhang Shujun
    Zhang Qun
    Li Hui
    [J]. JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2020, 42 (04) : 1021 - 1032
  • [9] Deep learning-based sign language recognition system for static signs
    Wadhawan, Ankita
    Kumar, Parteek
    [J]. NEURAL COMPUTING & APPLICATIONS, 2020, 32 (12): : 7957 - 7968
  • [10] Automated Arabic Sign Language Recognition System Based on Deep Transfer Learning
    Shahid, A., I
    Almotairi, Sultan
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2019, 19 (10): : 144 - 152