共 50 条
- [42] Jointly Learning From Unimodal and Multimodal-Rated Labels in Audio-Visual Emotion Recognition IEEE OPEN JOURNAL OF SIGNAL PROCESSING, 2025, 6 : 165 - 174
- [43] Multimodal Learning Using 3D Audio-Visual Data or Audio-Visual Speech Recognition 2017 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2017, : 40 - 43
- [44] DEEP MULTIMODAL LEARNING FOR AUDIO-VISUAL SPEECH RECOGNITION 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 2130 - 2134
- [46] Leveraging Modality-Specific Representations for Audio-Visual Speech Recognition via Reinforcement Learning THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 11, 2023, : 12607 - +
- [48] An audio-visual speech recognition with a new mandarin audio-visual database INT CONF ON CYBERNETICS AND INFORMATION TECHNOLOGIES, SYSTEMS AND APPLICATIONS/INT CONF ON COMPUTING, COMMUNICATIONS AND CONTROL TECHNOLOGIES, VOL 1, 2007, : 19 - +
- [50] Multimodal Deep Convolutional Neural Network for Audio-Visual Emotion Recognition ICMR'16: PROCEEDINGS OF THE 2016 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2016, : 281 - 284