MediSign: An Attention-Based CNN-BiLSTM Approach of Classifying Word Level Signs for Patient-Doctor Interaction in Hearing Impaired Community

被引:0
|
作者
Ihsan, Md. Amimul [1 ]
Eram, Abrar Faiaz [2 ]
Nahar, Lutfun [1 ]
Kadir, Muhammad Abdul [1 ]
机构
[1] Univ Dhaka, Dept Biomed Phys & Technol, Dhaka 1000, Bangladesh
[2] Bangladesh Univ Engn & Technol, Dept Elect & Elect Engn, Dhaka 1000, Bangladesh
关键词
Sign language; Assistive technologies; Hidden Markov models; Medical services; Error analysis; Patient monitoring; Deafness; Convolutional neural networks; Long short term memory; Bidirectional control; Attention; BiLSTM; MobileNetV2; patient-doctor interaction; sign language; RECOGNITION;
D O I
10.1109/ACCESS.2024.3370684
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Along with day-to-day communication, receiving medical care is quite challenging for the hearing impaired and mute population, especially in developing countries where medical facilities are not as modernized as in the West. A word-level sign language interpretation system that is aimed toward detecting medically relevant signs can allow smooth communication between doctors and hearing impaired patients, ensuring seamless medical care. To that end, a dataset from twenty distinct signers of diverse backgrounds performing 30 frequently used words in patient-doctor interaction was created. The proposed system has been built employing MobileNetV2 in conjunction with an attention-based Bidirectional LSTM network to achieve robust classification, where the validation accuracy and f1- scores were 95.83% and 93%, respectively. Notably, the accuracy of the proposed model surpasses the recent word-level sign language classification method in a medical context by 5%. Furthermore, the comparison of evaluation metrics with contemporary word-level sign language recognition models in American, Arabic, and German Sign Language further affirmed the capability of the proposed architecture.
引用
收藏
页码:33803 / 33815
页数:13
相关论文
共 1 条