共 50 条
- [1] Attention-Based Audio-Visual Fusion for Video Summarization [J]. NEURAL INFORMATION PROCESSING (ICONIP 2019), PT II, 2019, 11954 : 328 - 340
- [2] Attention-based Audio-Visual Fusion for Robust Automatic Speech Recognition [J]. ICMI'18: PROCEEDINGS OF THE 20TH ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2018, : 111 - 115
- [3] Attention-based cross-modal fusion for audio-visual voice activity detection in musical video streams [J]. INTERSPEECH 2021, 2021, : 321 - 325
- [4] Attention-based Visual Question Generation [J]. 2021 INTERNATIONAL CONFERENCE ON EMERGING SMART COMPUTING AND INFORMATICS (ESCI), 2021, : 82 - 86
- [5] Residual Attention-based Fusion for Video Classification [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 478 - 480
- [6] Attention-Based Multimodal Fusion for Video Description [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 4203 - 4212
- [7] A framework for estimating geometric distortions in video copies based on visual-audio fingerprints [J]. Signal, Image and Video Processing, 2015, 9 : 201 - 210
- [8] Hierarchical Attention-Based Fusion for Image Caption With Multi-Grained Rewards [J]. IEEE ACCESS, 2020, 8 : 57943 - 57951