共 50 条
- [42] Learning Audio-Visual Source Localization via False Negative Aware Contrastive Learning 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 6420 - 6429
- [44] Hierarchical multimodal attention for end -to -end audio-visual scene -aware dialogue response generation COMPUTER SPEECH AND LANGUAGE, 2020, 63
- [47] Detection of documentary scene changes by audio-visual fusion IMAGE AND VIDEO RETRIEVAL, PROCEEDINGS, 2003, 2728 : 227 - 237
- [48] VISA: An Ambiguous Subtitles Dataset for Visual Scene-Aware Machine Translation 2022 Language Resources and Evaluation Conference, LREC 2022, 2022, : 6735 - 6743
- [50] LEARNING SELECTIVE ASSIGNMENT NETWORK FOR SCENE-AWARE VEHICLE DETECTION 2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1366 - 1370