共 50 条
- [21] Auxiliary Loss Multimodal GRU Model in Audio-Visual Speech Recognition IEEE ACCESS, 2018, 6 : 5573 - 5583
- [22] Speech enhancement and recognition in meetings with an audio-visual sensor array IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (08): : 2257 - 2269
- [24] Multimodal Integration for Large-Vocabulary Audio-Visual Speech Recognition 28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 341 - 345
- [25] An audio-visual speech recognition with a new mandarin audio-visual database INT CONF ON CYBERNETICS AND INFORMATION TECHNOLOGIES, SYSTEMS AND APPLICATIONS/INT CONF ON COMPUTING, COMMUNICATIONS AND CONTROL TECHNOLOGIES, VOL 1, 2007, : 19 - +
- [26] Integration of Deep Bottleneck Features for Audio-Visual Speech Recognition 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 563 - 567
- [27] MULTIPOSE AUDIO-VISUAL SPEECH RECOGNITION 19TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2011), 2011, : 1065 - 1069
- [29] Audio-visual speech recognition by speechreading DSP 2002: 14TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING PROCEEDINGS, VOLS 1 AND 2, 2002, : 1069 - 1072
- [30] Transfer Learning from Audio-Visual Grounding to Speech Recognition INTERSPEECH 2019, 2019, : 3242 - 3246