Audio-Visual Automatic Speech Recognition and Related Bimodal Speech Technologies: A Review of the State-of-the-Art and Open Problems

被引：5

作者：

Potamianos, Gerasimos ^{[1
]}

机构：

[1] Natl Ctr Sci Res Demokritos, Inst Informat & Telecommun, GR-15310 Athens, Greece

来源：

2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009) | 2009年

关键词：

D O I：

10.1109/ASRU.2009.5373530

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

引用

页码：22 / 22

页数：1

共 50 条

[21] Speaker independent audio-visual speech recognition
Zhang, Y
Levinson, S
Huang, T
2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 1073 - 1076
[22] An asynchronous DBN for audio-visual speech recognition
Saenko, Kate
Livescu, Karen
2006 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, 2006, : 154 - +
[23] An audio-visual speech recognition system for testing new audio-visual databases
Pao, Tsang-Long
Liao, Wen-Yuan
VISAPP 2006: PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 2, 2006, : 192 - +
[24] LEARNING CONTEXTUALLY FUSED AUDIO-VISUAL REPRESENTATIONS FOR AUDIO-VISUAL SPEECH RECOGNITION
Zhang, Zi-Qiang
Zhang, Jie
Zhang, Jian-Shu
Wu, Ming-Hui
Fang, Xin
Dai, Li-Rong
2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1346 - 1350
[25] End-to-end audio-visual speech recognition for overlapping speech
Rose, Richard
Siohan, Olivier
Tripathi, Anshuman
Braga, Otavio
INTERSPEECH 2021, 2021, : 3016 - 3020
[26] THE STATE-OF-THE-ART IN SPEECH RECOGNITION
BISIANI, R
TRENDS IN NEUROSCIENCES, 1985, 8 (01) : 9 - 11
[27] Audio-Visual Automatic Speech Recognition Using PZM, MFCC and Statistical Analysis
Debnath, Saswati
Roy, Pinki
INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2021, 7 (02): : 121 - 133
[28] Attention-based Audio-Visual Fusion for Robust Automatic Speech Recognition
Sterpu, George
Saam, Christian
Harte, Naomi
ICMI'18: PROCEEDINGS OF THE 20TH ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2018, : 111 - 115
[29] RETRACTED ARTICLE: Audio-Visual Automatic Speech Recognition Towards Education for Disabilities
Saswati Debnath
Pinki Roy
Suyel Namasudra
Ruben Gonzalez Crespo
Journal of Autism and Developmental Disorders, 2023, 53 : 3581 - 3594
[30] Retraction Note: Audio-Visual Automatic Speech Recognition Towards Education for Disabilities
Saswati Debnath
Pinki Roy
Suyel Namasudra
Ruben Gonzalez Crespo
Journal of Autism and Developmental Disorders, 2024, 54 : 1627 - 1627

← 1 2 3 4 5 →