共 50 条
- [3] LEARNING CONTEXTUALLY FUSED AUDIO-VISUAL REPRESENTATIONS FOR AUDIO-VISUAL SPEECH RECOGNITION [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1346 - 1350
- [4] Deep Audio-visual Learning: A Survey [J]. Machine Intelligence Research, 2021, 18 (03) : 351 - 376
- [5] Deep Audio-visual Learning: A Survey [J]. International Journal of Automation and Computing, 2021, 18 : 351 - 376
- [6] The research on Digital audio-visual synesthesia [J]. 10TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION (ICCSE 2015), 2015, : 186 - 189
- [8] Multimodal Learning Using 3D Audio-Visual Data or Audio-Visual Speech Recognition [J]. 2017 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2017, : 40 - 43
- [9] ADVERSARIAL INPUT ABLATION FOR AUDIO-VISUAL LEARNING [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7742 - 7746
- [10] Learning Bimodal Structure in Audio-Visual Data [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2009, 20 (12): : 1898 - 1910