共 50 条
- [32] Cross-Modal Relation-Aware Networks for Audio-Visual Event Localization MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 3893 - 3901
- [33] Cross-Modal Label Contrastive Learning for Unsupervised Audio-Visual Event Localization THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 1, 2023, : 215 - 222
- [34] Span-based Audio-Visual Localization PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 1252 - 1260
- [35] Audio-Visual Event Localization via Recursive Fusion by Joint Co-Attention 2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 4012 - 4021
- [36] Audio-Visual Salieny Network with Audio Attention Module PROCEEDINGS OF 2021 2ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND INFORMATION SYSTEMS (ICAIIS '21), 2021,
- [37] Acoustic and Visual Knowledge Distillation for Contrastive Audio-Visual Localization PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, ICMI 2023, 2023, : 15 - 23
- [38] AUDIO-VISUAL EVENT RECOGNITION THROUGH THE LENS OF ADVERSARY 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 616 - 620