Lexical Interpretation of Visual Cues Using Deep Learning

被引:0
|
作者
Budarapu, Amrita [1 ]
Jain, Komal [1 ]
Sree, S. Bindu [1 ]
Varshitha, T. [1 ]
Niveditha, B. [1 ]
机构
[1] Narayanamma Inst Technol & Sci, Dept CSE AI&ML, Hyderabad, India
关键词
Lexical interpretation; Lip Reading; CNN; GRU; Visual cues;
D O I
10.1007/978-981-97-8031-0_89
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Lexical interpretation of visual cues is an approach for understanding spoken phrases by visually observing the movements and shapes of a speaker's lips. A comprehensive review of the existing methods exposes the limitations of traditional lip reading techniques in capturing both spatial and temporal dimensions of lip movements. To address this gap, this project presents an approach to advance lip reading efficacy by synergizing Convolutional Neural Networks (CNN) and Gated Recurrent Units (GRU). The system has achieved an accuracy of 97% on GRID dataset. The implications of this research extend to improved communication accessibility for individuals with hearing impairments as well as broader applications in areas such as criminal investigations and security.
引用
收藏
页码:833 / 842
页数:10
相关论文
共 50 条
  • [31] The role of auditory and visual cues in the interpretation of Mandarin ironic speech
    Li, Shanpeng
    Chen, Aoju
    Chen, Ying
    Tang, Ping
    JOURNAL OF PRAGMATICS, 2022, 201 : 3 - 14
  • [32] Automatic Classification of Lexical Stress in English and Arabic Languages using Deep Learning
    Shahin, Mostafa
    Epps, Julien
    Ahmed, Beena
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 175 - 179
  • [33] Support vector learning for gender classification using audio and visual cues: A comparison
    Walawalkar, L
    Yeasin, M
    Narasimhamurthy, AM
    Sharma, R
    PATTERN RECOGNITION WITH SUPPORT VECTOR MACHINES, PROCEEDINGS, 2002, 2388 : 144 - 159
  • [34] Biased Competition in Visual Processing Hierarchies: A Learning Approach Using Multiple Cues
    Gepperth, Alexander R. T.
    Rebhan, Sven
    Hasler, Stephan
    Fritsch, Jannik
    COGNITIVE COMPUTATION, 2011, 3 (01) : 146 - 166
  • [35] Lexical and referential cues to sentence interpretation: an investigation of children's interpretations of ambiguous sentences
    Kidd, E
    Bavin, EL
    JOURNAL OF CHILD LANGUAGE, 2005, 32 (04) : 855 - 876
  • [36] The Effect of Nonverbal Cues on the Interpretation of Utterances by People with Visual Impairments
    Sak-Wernicka, Jolanta
    JOURNAL OF VISUAL IMPAIRMENT & BLINDNESS, 2014, 108 (02) : 133 - 143
  • [37] Biased Competition in Visual Processing Hierarchies: A Learning Approach Using Multiple Cues
    Alexander R. T. Gepperth
    Sven Rebhan
    Stephan Hasler
    Jannik Fritsch
    Cognitive Computation, 2011, 3 : 146 - 166
  • [38] Deep learning approaches to lexical simplification: A survey
    North, Kai
    Ranasinghe, Tharindu
    Shardlow, Matthew
    Zampieri, Marcos
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2024, : 111 - 134
  • [39] Learning Human Activity From Visual Data Using Deep Learning
    Alhersh, Taha
    Stuckenschmidt, Heiner
    Rehman, Atiq Ur
    Belhaouari, Samir Brahim
    IEEE ACCESS, 2021, 9 : 106245 - 106253
  • [40] Deep Learning for Human Visual Attention Recognition Using Transfer Learning
    Nam Vu Hoai
    Huong Nguyen Mai
    Cuong Pham
    2018 INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR COMMUNICATIONS (ATC), 2018, : 42 - 46