共 50 条
- [41] Relevant feature selection for audio-visual speech recognition [J]. 2007 IEEE NINTH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2007, : 179 - +
- [42] DEEP MULTIMODAL LEARNING FOR AUDIO-VISUAL SPEECH RECOGNITION [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 2130 - 2134
- [43] Weighting schemes for audio-visual fusion in speech recognition [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 173 - 176
- [44] Multistage information fusion for audio-visual speech recognition [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3, 2004, : 1651 - 1654
- [45] Dynamic Bayesian Networks for Audio-Visual Speech Recognition [J]. EURASIP Journal on Advances in Signal Processing, 2002
- [46] Audio-Visual Efficient Conformer for Robust Speech Recognition [J]. 2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 2257 - 2266
- [47] On Dynamic Stream Weighting for Audio-Visual Speech Recognition [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (04): : 1145 - 1157
- [48] Audio-visual speech recognition using deep learning [J]. APPLIED INTELLIGENCE, 2015, 42 (04) : 722 - 737
- [50] Towards practical deployment of audio-visual speech recognition [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE AND MULTIDIMENSIONAL SIGNAL PROCESSING SPECIAL SESSIONS, 2004, : 777 - 780