Audio-Visual Automatic Speech Recognition and Related Bimodal Speech Technologies: A Review of the State-of-the-Art and Open Problems

被引:5
|
作者
Potamianos, Gerasimos [1 ]
机构
[1] Natl Ctr Sci Res Demokritos, Inst Informat & Telecommun, GR-15310 Athens, Greece
关键词
D O I
10.1109/ASRU.2009.5373530
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
引用
收藏
页码:22 / 22
页数:1
相关论文
共 50 条
  • [21] Speaker independent audio-visual speech recognition
    Zhang, Y
    Levinson, S
    Huang, T
    2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 1073 - 1076
  • [22] An asynchronous DBN for audio-visual speech recognition
    Saenko, Kate
    Livescu, Karen
    2006 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, 2006, : 154 - +
  • [23] An audio-visual speech recognition system for testing new audio-visual databases
    Pao, Tsang-Long
    Liao, Wen-Yuan
    VISAPP 2006: PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 2, 2006, : 192 - +
  • [24] LEARNING CONTEXTUALLY FUSED AUDIO-VISUAL REPRESENTATIONS FOR AUDIO-VISUAL SPEECH RECOGNITION
    Zhang, Zi-Qiang
    Zhang, Jie
    Zhang, Jian-Shu
    Wu, Ming-Hui
    Fang, Xin
    Dai, Li-Rong
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1346 - 1350
  • [25] End-to-end audio-visual speech recognition for overlapping speech
    Rose, Richard
    Siohan, Olivier
    Tripathi, Anshuman
    Braga, Otavio
    INTERSPEECH 2021, 2021, : 3016 - 3020
  • [26] THE STATE-OF-THE-ART IN SPEECH RECOGNITION
    BISIANI, R
    TRENDS IN NEUROSCIENCES, 1985, 8 (01) : 9 - 11
  • [27] Audio-Visual Automatic Speech Recognition Using PZM, MFCC and Statistical Analysis
    Debnath, Saswati
    Roy, Pinki
    INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2021, 7 (02): : 121 - 133
  • [28] Attention-based Audio-Visual Fusion for Robust Automatic Speech Recognition
    Sterpu, George
    Saam, Christian
    Harte, Naomi
    ICMI'18: PROCEEDINGS OF THE 20TH ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2018, : 111 - 115
  • [29] RETRACTED ARTICLE: Audio-Visual Automatic Speech Recognition Towards Education for Disabilities
    Saswati Debnath
    Pinki Roy
    Suyel Namasudra
    Ruben Gonzalez Crespo
    Journal of Autism and Developmental Disorders, 2023, 53 : 3581 - 3594
  • [30] Retraction Note: Audio-Visual Automatic Speech Recognition Towards Education for Disabilities
    Saswati Debnath
    Pinki Roy
    Suyel Namasudra
    Ruben Gonzalez Crespo
    Journal of Autism and Developmental Disorders, 2024, 54 : 1627 - 1627