Lexical Interpretation of Visual Cues Using Deep Learning

被引:0
|
作者
Budarapu, Amrita [1 ]
Jain, Komal [1 ]
Sree, S. Bindu [1 ]
Varshitha, T. [1 ]
Niveditha, B. [1 ]
机构
[1] Narayanamma Inst Technol & Sci, Dept CSE AI&ML, Hyderabad, India
关键词
Lexical interpretation; Lip Reading; CNN; GRU; Visual cues;
D O I
10.1007/978-981-97-8031-0_89
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Lexical interpretation of visual cues is an approach for understanding spoken phrases by visually observing the movements and shapes of a speaker's lips. A comprehensive review of the existing methods exposes the limitations of traditional lip reading techniques in capturing both spatial and temporal dimensions of lip movements. To address this gap, this project presents an approach to advance lip reading efficacy by synergizing Convolutional Neural Networks (CNN) and Gated Recurrent Units (GRU). The system has achieved an accuracy of 97% on GRID dataset. The implications of this research extend to improved communication accessibility for individuals with hearing impairments as well as broader applications in areas such as criminal investigations and security.
引用
收藏
页码:833 / 842
页数:10
相关论文
共 50 条
  • [1] Lip Reading Sentences Using Deep Learning With Only Visual Cues
    Fenghour, Souheil
    Chen, Daqing
    Guo, Kun
    Xiao, Perry
    IEEE ACCESS, 2020, 8 : 215516 - 215530
  • [2] Recognizing Indonesian words based on visual cues of lip movement using deep learning
    Rahmatullah, Griffani Megiyanto
    Ruan, Shanq-Jang
    Li, Lieber Po-Hung
    MEASUREMENT, 2025, 250
  • [3] Learning pavement surface condition ratings through visual cues using a deep learning classification approach
    Qureshi, Waqar S.
    Power, David
    McHale, Joseph
    Mulry, Brian
    Feighan, Kieran
    Sullivan, Dympna O.
    2022 IEEE 18TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTER COMMUNICATION AND PROCESSING, ICCP, 2022, : 205 - 212
  • [4] Exploring Visual Cues for Design Analogy: A Deep Learning Approach
    Zhang, Zijian
    Jin, Yan
    JOURNAL OF MECHANICAL DESIGN, 2022, 144 (12)
  • [5] Deep learning visual interpretation of structural damage images
    Gao, Yuqing
    Mosalam, Khalid M.
    JOURNAL OF BUILDING ENGINEERING, 2022, 60
  • [6] VISUAL CUES IN THE INTERPRETATION OF MEDICAL IMAGES
    KUNDEL, HL
    JOURNAL OF CLINICAL NEUROPHYSIOLOGY, 1990, 7 (04) : 472 - 483
  • [7] Visual Interpretation of Topology Optimization Results Based on Deep Learning
    Sato, Hayaho
    Igarashi, Hajime
    IEEE TRANSACTIONS ON MAGNETICS, 2024, 60 (03) : 1 - 4
  • [8] RETRACTED ARTICLE: Post hoc visual interpretation using a deep learning-based smooth feature networkPost hoc visual interpretation using a deep learning…I. Naseem Abbasi et al.
    Iqra Naseem Abbasi
    Tahir Mustafa Madni
    Muhammad Khalid Sohail
    Uzair Iqbal Janjua
    Jamal Abdul Nasir
    Soft Computing, 2024, 28 (Suppl 2) : 797 - 797
  • [9] Post hoc visual interpretation using a deep learning-based smooth feature network
    Abbasi, Iqra Naseem
    Madni, Tahir Mustafa
    Sohail, Muhammad Khalid
    Janjua, Uzair Iqbal
    Nasir, Jamal Abdul
    SOFT COMPUTING, 2023,
  • [10] IMPACT OF VISUAL, VOCAL, AND LEXICAL CUES ON JUDGMENTS OF COUNSELOR QUALITIES
    STRAHAN, C
    ZYTOWSKI, DG
    JOURNAL OF COUNSELING PSYCHOLOGY, 1976, 23 (04) : 387 - 393