Human-Computer Interaction with Hand Gesture Recognition Using ResNet and MobileNet

被引:6
|
作者
Alnuaim, Abeer [1 ]
Zakariah, Mohammed [2 ]
Hatamleh, Wesam Atef [2 ]
Tarazi, Hussam [3 ]
Tripathi, Vikas [4 ]
Amoatey, Enoch Tetteh [5 ]
机构
[1] King Saud Univ, Dept Comp Sci & Engn, Coll Appl Studies & Community Serv, POB 22459, Riyadh 11495, Saudi Arabia
[2] King Saud Univ, Coll Comp & Informat Sci, Dept Comp Sci, POB 51178, Riyadh 11543, Saudi Arabia
[3] Oakland Univ, Sch Engn & Comp Sci, Dept Comp Sci & Informat, 318 Meadow Brook Rd, Rochester, MI 48309 USA
[4] Coll Graph Era Deemed Univ, Dept Comp Sci & Engn, Dehra Dun, Uttarakhand, India
[5] Univ Dev Studies, Sch Engn, Tamale, Ghana
关键词
703.2 Electric Filters - 716.1 Information Theory and Signal Processing - 723.5 Computer Applications;
D O I
10.1155/2022/8777355
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Sign language is the native language of deaf people, which they use in their daily life, and it facilitates the communication process between deaf people. The problem faced by deaf people is targeted using sign language technique. Sign language refers to the use of the arms and hands to communicate, particularly among those who are deaf. This varies depending on the person and the location from which they come. As a result, there is no standardization about the sign language to be used; for example, American, British, Chinese, and Arab sign languages are all distinct. Here, in this study we trained a model, which will be able to classify the Arabic sign language, which consists of 32 Arabic alphabet sign classes. In images, sign language is detected through the pose of the hand. In this study, we proposed a framework, which consists of two CNN models, and each of them is individually trained on the training set. The final predictions of the two models were ensembled to achieve higher results. The dataset used in this study is released in 2019 and is called as ArSL2018. It is launched at the Prince Mohammad Bin Fahd University, Al Khobar, Saudi Arabia. The main contribution in this study is resizing the images to 64 * 64 pixels, converting from grayscale images to three-channel images, and then applying the median filter to the images, which acts as lowpass filtering in order to smooth the images and reduce noise and to make the model more robust to avoid overfitting. Then, the preprocessed image is fed into two different models, which are ResNet50 and MobileNetV2. ResNet50 and MobileNetV2 architectures were implemented together. The results we achieved on the test set for the whole data are with an accuracy of about 97% after applying many preprocessing techniques and different hyperparameters for each model, and also different data augmentation techniques.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Recognition of hand gesture to human-computer interaction
    Lee, LK
    Kim, S
    Choi, YK
    Lee, MH
    [J]. IECON 2000: 26TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, VOLS 1-4: 21ST CENTURY TECHNOLOGIES AND INDUSTRIAL OPPORTUNITIES, 2000, : 2117 - 2122
  • [2] A hand gesture recognition technique for human-computer interaction
    Kiliboz, Nurettin Cagri
    Gudukbay, Ugur
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2015, 28 : 97 - 104
  • [3] Face and hand gesture recognition for human-computer interaction
    Hongo, H
    Ohya, M
    Yasumoto, M
    Yamamoto, K
    [J]. 15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, PROCEEDINGS: PATTERN RECOGNITION AND NEURAL NETWORKS, 2000, : 921 - 924
  • [4] Design of hand gesture recognition system for human-computer interaction
    Tsai, Tsung-Han
    Huang, Chih-Chi
    Zhang, Kung-Long
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (9-10) : 5989 - 6007
  • [5] A visual system for hand gesture recognition in human-computer interaction
    Okkonen, Matti-Antero
    Kellokumpu, Vili
    Pietikainen, Matti
    Heikkilae, Janne
    [J]. IMAGE ANALYSIS, PROCEEDINGS, 2007, 4522 : 709 - +
  • [6] THE METHOD FOR HUMAN-COMPUTER INTERACTION BASED ON HAND GESTURE RECOGNITION
    Raudonis, Vidas
    Jonaitis, Domas
    [J]. PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON ELECTRICAL AND CONTROL TECHNOLOGIES, 2013, : 45 - 49
  • [7] Design of hand gesture recognition system for human-computer interaction
    Tsung-Han Tsai
    Chih-Chi Huang
    Kung-Long Zhang
    [J]. Multimedia Tools and Applications, 2020, 79 : 5989 - 6007
  • [8] Continuous Body and Hand Gesture Recognition for Natural Human-Computer Interaction
    Song, Yale
    Demirdjian, David
    Davis, Randall
    [J]. ACM TRANSACTIONS ON INTERACTIVE INTELLIGENT SYSTEMS, 2012, 2 (01) : 1 - 28
  • [9] Continuous Body and Hand Gesture Recognition for Natural Human-Computer Interaction
    Song, Yale
    Davis, Randall
    [J]. PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 4212 - 4216
  • [10] Human-computer interaction using gesture recognition and 3D hand tracking
    Segen, J
    Kumar, S
    [J]. 1998 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING - PROCEEDINGS, VOL 3, 1998, : 188 - 192