Deep Learning in Sign Language Recognition: A Hybrid Approach for the Recognition of Static and Dynamic Signs

被引:5
|
作者
Buttar, Ahmed Mateen [1 ]
Ahmad, Usama [1 ]
Gumaei, Abdu H. [2 ]
Assiri, Adel [3 ]
Akbar, Muhammad Azeem [4 ]
Alkhamees, Bader Fahad [5 ]
机构
[1] Univ Agr Faisalabad, Dept Comp Sci, Faisalabad 38000, Pakistan
[2] Prince Sattam Bin Abdulaziz Univ, Coll Comp Engn & Sci, Dept Comp Sci, Al Kharj 11942, Saudi Arabia
[3] King Khalid Univ, Coll Business, Management Informat Syst Dept, Abha 61421, Saudi Arabia
[4] LUT Univ, Software Engn Dept, Lahti 15210, Finland
[5] King Saud Univ, Coll Comp & Informat Sci, Dept Informat Syst, Riyadh 11543, Saudi Arabia
关键词
You Only Look Once (YOLO); Long Short-Term Memory (LSTM); deep learning; confusion matrix; convolutional neural network (CNN); MediaPipe holistic;
D O I
10.3390/math11173729
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
A speech impairment limits a person's capacity for oral and auditory communication. A great improvement in communication between the deaf and the general public would be represented by a real-time sign language detector. This work proposes a deep learning-based algorithm that can identify words from a person's gestures and detect them. There have been many studies on this topic, but the development of static and dynamic sign language recognition models is still a challenging area of research. The difficulty is in obtaining an appropriate model that addresses the challenges of continuous signs that are independent of the signer. Different signers' speeds, durations, and many other factors make it challenging to create a model with high accuracy and continuity. For the accurate and effective recognition of signs, this study uses two different deep learning-based approaches. We create a real-time American Sign Language detector using the skeleton model, which reliably categorizes continuous signs in sign language in most cases using a deep learning approach. In the second deep learning approach, we create a sign language detector for static signs using YOLOv6. This application is very helpful for sign language users and learners to practice sign language in real time. After training both algorithms separately for static and continuous signs, we create a single algorithm using a hybrid approach. The proposed model, consisting of LSTM with MediaPipe holistic landmarks, achieves around 92% accuracy for different continuous signs, and the YOLOv6 model achieves 96% accuracy over different static signs. Throughout this study, we determine which approach is best for sequential movement detection and for the classification of different signs according to sign language and shows remarkable accuracy in real time.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Deep learning-based sign language recognition system for static signs
    Wadhawan, Ankita
    Kumar, Parteek
    [J]. NEURAL COMPUTING & APPLICATIONS, 2020, 32 (12): : 7957 - 7968
  • [2] Deep learning-based sign language recognition system for static signs
    Ankita Wadhawan
    Parteek Kumar
    [J]. Neural Computing and Applications, 2020, 32 : 7957 - 7968
  • [3] Real Time Static and Dynamic Sign Language Recognition using Deep Learning
    Jayanthi, P.
    Bhama, Ponsy R. K. Sathia
    Swetha, K.
    Subash, S. A.
    [J]. JOURNAL OF SCIENTIFIC & INDUSTRIAL RESEARCH, 2022, 81 (11): : 1186 - 1194
  • [4] Fast recognition and classification of static and dynamic signs for Persian sign language
    Moghaddam, Milad
    Nahvi, Manoochehr
    Pourmomtaz, Negin
    [J]. INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2020, 63 (03) : 228 - 240
  • [5] Fast recognition and classification of static and dynamic signs for Persian sign language
    Moghaddam, Milad
    Nahvi, Manoochehr
    Pourmomtaz, Negin
    [J]. International Journal of Computer Applications in Technology, 2020, 63 (03): : 228 - 240
  • [6] Vision Based Deep Learning Approach for Dynamic Indian Sign Language Recognition in Healthcare
    Uchil, Aditya P.
    Jha, Smriti
    Sudha, B. G.
    [J]. COMPUTATIONAL VISION AND BIO-INSPIRED COMPUTING, 2020, 1108 : 371 - 383
  • [7] ESMAANI: A Static and Dynamic Arabic Sign Language Recognition System Based on Machine and Deep Learning Models
    Hisham, Essam
    Saleh, Sherine Nagy
    [J]. 2022 5TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS, SIGNAL PROCESSING, AND THEIR APPLICATIONS (ICCSPA), 2022,
  • [8] Learning Signs from Subtitles: A Weakly Supervised Approach to Sign Language Recognition
    Cooper, Helen
    Bowden, Richard
    [J]. CVPR: 2009 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-4, 2009, : 2560 - 2566
  • [9] Sign Language Recognition Using Deep Learning
    Ray, Anushka
    Syed, Shahbaz
    Poornima, S.
    Pushpalatha, M.
    [J]. JOURNAL OF PHARMACEUTICAL NEGATIVE RESULTS, 2022, 13 : 421 - 428
  • [10] A Deep Learning based Approach for Recognition of Arabic Sign Language Letters
    Hdioud, Boutaina
    Tirari, Mohammed El Haj
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (04) : 424 - 429