Deep Learning of Mouth Shapes for Sign Language

被引:57
|
作者
Koller, Oscar [1 ]
Ney, Hermann [1 ]
Bowden, Richard [2 ]
机构
[1] Rhein Westfal TH Aachen, Human Language Technol & Pattern Recog, Aachen, Germany
[2] Univ Surrey, Ctr Vis Speech & Signal Proc, Guildford GU2 5XH, Surrey, England
基金
英国工程与自然科学研究理事会;
关键词
D O I
10.1109/ICCVW.2015.69
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper deals with robust modelling of mouth shapes in the context of sign language recognition using deep convolutional neural networks. Sign language mouth shapes are difficult to annotate and thus hardly any publicly available annotations exist. As such, this work exploits related information sources as weak supervision. Humans mainly look at the face during sign language communication, where mouth shapes play an important role and constitute natural patterns with large variability. However, most scientific research on sign language recognition still disregards the face. Hardly any works explicitly focus on mouth shapes. This paper presents our advances in the field of sign language recognition. We contribute in following areas: We present a scheme to learn a convolutional neural network in a weakly supervised fashion without explicit frame labels. We propose a way to incorporate neural network classifier outputs into a HMM approach. Finally, we achieve a significant improvement in classification performance of mouth shapes over the current state of the art.
引用
收藏
页码:477 / 483
页数:7
相关论文
共 50 条
  • [41] How Language Shapes Word of Mouth's Impact
    Packard, Grant
    Berger, Jonah
    JOURNAL OF MARKETING RESEARCH, 2017, 54 (04) : 572 - 588
  • [42] Feasibility Study on Deep Learning Scheme for Sign Language Motion Recognition
    Sakamoto, Kazuki
    Ota, Eiji
    Ozawa, Tatsunori
    Nishimura, Hiromitsu
    Tanaka, Hiroshi
    COMPLEX, INTELLIGENT, AND SOFTWARE INTENSIVE SYSTEMS, 2019, 772 : 1106 - 1115
  • [43] A review of deep learning-based approaches to sign language processing
    Tan, Sihan
    Khan, Nabeela
    An, Zhaoyi
    Ando, Yoshitaka
    Kawakami, Rei
    Nakadai, Kazuhiro
    Advanced Robotics, 2024, 38 (23) : 1649 - 1667
  • [44] Towards Realizing Sign Language to Emotional Speech Conversion by Deep Learning
    Wang, Weizhe
    Yang, Hongwu
    2021 12TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2021,
  • [45] Recognition of Indian Sign Language (ISL) Using Deep Learning Model
    Sharma, Sakshi
    Singh, Sukhwinder
    WIRELESS PERSONAL COMMUNICATIONS, 2022, 123 (01) : 671 - 692
  • [46] An Applied Holistic Landmark with Deep Learning for Thai Sign Language Recognition
    Chaikaew, Anusorn
    2022 37TH INTERNATIONAL TECHNICAL CONFERENCE ON CIRCUITS/SYSTEMS, COMPUTERS AND COMMUNICATIONS (ITC-CSCC 2022), 2022, : 1046 - 1049
  • [47] Implementation of wearable glove for sign language expression based on deep learning
    Hyeon-Jun Kim
    Soo-Whang Baek
    Microsystem Technologies, 2023, 29 : 1147 - 1163
  • [48] Sign Language Semantic Translation System using Ontology and Deep Learning
    Elsayed, Eman K.
    Fathy, Doaa R.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (01) : 141 - 147
  • [49] Development of a Sign Language for Total Paralysis and Interpretation using Deep Learning
    Dhrubo, Zahan Zib Sarowar
    Hridoy, Md Ashiful Islam
    Jamal, Lafifa
    Sarker, Sujan
    Shidujaman, Mohammad
    2020 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING AND ROBOTICS (ICIPROB 2020, 2020,
  • [50] Sign Language Gesture Recognition using Doppler Radar and Deep Learning
    Kulhandjian, Hovannes
    Sharma, Prakshi
    Kulhandjian, Michel
    D'Amours, Claude
    2019 IEEE GLOBECOM WORKSHOPS (GC WKSHPS), 2019,