共 50 条
- [31] Weighting schemes for audio-visual fusion in speech recognition [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 173 - 176
- [32] Dynamic Bayesian Networks for Audio-Visual Speech Recognition [J]. EURASIP Journal on Advances in Signal Processing, 2002
- [33] Connectionism based audio-visual speech recognition method [J]. Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2024, 54 (10): : 2984 - 2993
- [35] On Dynamic Stream Weighting for Audio-Visual Speech Recognition [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (04): : 1145 - 1157
- [36] Towards practical deployment of audio-visual speech recognition [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE AND MULTIDIMENSIONAL SIGNAL PROCESSING SPECIAL SESSIONS, 2004, : 777 - 780
- [37] Audio-visual speech recognition using deep learning [J]. APPLIED INTELLIGENCE, 2015, 42 (04) : 722 - 737
- [38] An audio-visual corpus for multimodal automatic speech recognition [J]. Journal of Intelligent Information Systems, 2017, 49 : 167 - 192
- [40] Audio-Visual Efficient Conformer for Robust Speech Recognition [J]. 2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 2257 - 2266