共 50 条
- [1] Attention-based Audio-Visual Fusion for Robust Automatic Speech Recognition [J]. ICMI'18: PROCEEDINGS OF THE 20TH ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2018, : 111 - 115
- [2] Attention-based Visual-Audio Fusion for Video Caption Generation [J]. 2019 IEEE 4TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2019), 2019, : 839 - 844
- [3] Attention-based cross-modal fusion for audio-visual voice activity detection in musical video streams [J]. INTERSPEECH 2021, 2021, : 321 - 325
- [4] VIDEO CODING BASED ON AUDIO-VISUAL ATTENTION [J]. ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 57 - 60
- [8] NOISE-TOLERANT AUDIO-VISUAL ONLINE PERSON VERIFICATION USING AN ATTENTION-BASED NEURAL NETWORK FUSION [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 3995 - 3999
- [9] Multi-Attention Audio-Visual Fusion Network for Audio Spatialization [J]. PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21), 2021, : 394 - 401
- [10] The effect of using video title in attention-based video summarization [J]. 2024 2ND ASIA CONFERENCE ON COMPUTER VISION, IMAGE PROCESSING AND PATTERN RECOGNITION, CVIPPR 2024, 2024,