Efficient Sign Language Recognition System and Dataset Creation Method Based on Deep Learning and Image Processing

被引：2

作者：

Cavalcante Carneiro, A. L. ^{[1
]}

Silva, L. Brito ^{[1
]}

Pinheiro Salvadeo, D. H. ^{[1
]}

机构：

[1] State Univ Sao Paulo, Dept Stat Appl Math & Computat, Av 24A,1515, BR-13506700 Rio Claro, SP, Brazil

来源：

THIRTEENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2021) | 2021年 / 11878卷

关键词：

Deep learning; sign language recognition; convolutional neural networks; image classification; object detection;

D O I：

10.1117/12.2601018

中图分类号：

O43 [光学];

学科分类号：

070207 ; 0803 ;

摘要：

New deep-learning architectures are created every year, achieving state-of-the-art results in image recognition and leading to the belief that, in a few years, complex tasks such as sign language translation will be considerably easier, serving as a communication tool for the hearing-impaired community. On the other hand, these algorithms still need a lot of data to be trained and the dataset creation process is expensive, time-consuming, and slow. Thereby, this work aims to investigate techniques of digital image processing and machine learning that can be used to create a sign language dataset effectively. We argue about data acquisition, such as the frames per second rate to capture or subsample the videos, the background type, preprocessing, and data augmentation, using convolutional neural networks and object detection to create an image classifier and comparing the results based on statistical tests. Different datasets were created to test the hypotheses, containing 14 words used daily and recorded by different smartphones in the RGB color system. We achieved an accuracy of 96.38% on the test set and 81.36% on the validation set containing more challenging conditions, showing that 30 FPS is the best frame rate subsample to train the classifier, geometric transformations work better than intensity transformations, and artificial background creation is not effective to model generalization. These trade-offs should be considered in future work as a cost-benefit guideline between computational cost and accuracy gain when creating a dataset and training a sign recognition model.

引用

页数：9

共 50 条

[21] Deep Learning-Based Approach for Sign Language Gesture Recognition With Efficient Hand Gesture Representation
Al-Hammadi, Muneer
Muhammad, Ghulam
Abdul, Wadood
Alsulaiman, Mansour
Bencherif, Mohammed A.
Alrayes, Tareq S.
Mathkour, Hassan
Mekhtiche, Mohamed Amine
[J]. IEEE ACCESS, 2020, 8 (08): : 192527 - 192542
[22] An Efficient Damage Relief System based on Image Processing and Deep Learning Techniques
Kanya, N.
Rani, Pacha Shobha
Geetha, S.
Rajkumar, M.
Sandhiya, G.
[J]. REVISTA GEINTEC-GESTAO INOVACAO E TECNOLOGIAS, 2021, 11 (02): : 2124 - 2131
[23] A Deep Learning based Approach for Recognition of Arabic Sign Language Letters
Hdioud, Boutaina
Tirari, Mohammed El Haj
[J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (04) : 424 - 429
[24] A review of deep learning-based approaches to sign language processing
Tan, Sihan
Khan, Nabeela
An, Zhaoyi
Ando, Yoshitaka
Kawakami, Rei
Nakadai, Kazuhiro
[J]. Advanced Robotics, 2024, 38 (23) : 1649 - 1667
[25] Continuous Sign Language Recognition System Using Deep Learning with MediaPipe Holistic
Srivastava, Sharvani
Singh, Sudhakar
Pooja, Shiv
Prakash, Shiv
[J]. WIRELESS PERSONAL COMMUNICATIONS, 2024, 137 (03) : 1455 - 1468
[26] Selecting Suitable Data Input for Deep-Learning Sign-Language Recognition with a Small Dataset
Chen, Yu-Jen
Su, Po-Chyi
[J]. 2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 384 - 391
[27] Ghanaian Sign Language Recognition Using Deep Learning
Odartey, Lamptey K.
Huang, Yonfeng
Asantewaa, Effah E.
Agbedanu, Promise R.
[J]. PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE (PRAI 2019), 2019, : 81 - 86
[28] Recent Advances on Deep Learning for Sign Language Recognition
Zhang, Yanqiong
Jiang, Xianwei
[J]. CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2024, 139 (03): : 2399 - 2450
[29] Recent Advances of Deep Learning for Sign Language Recognition
Zheng, Lihong
Liang, Bin
Jiang, Ailian
[J]. 2017 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING - TECHNIQUES AND APPLICATIONS (DICTA), 2017, : 454 - 460
[30] Deep Learning Methods for Indian Sign Language Recognition
Likhar, Pratik
Bhagat, Neel Kamal
Rathna, G. N.
[J]. 2020 IEEE 10TH INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE-BERLIN), 2020,

← 1 2 3 4 5 →