共 50 条
- [41] Acoustic and Visual Knowledge Distillation for Contrastive Audio-Visual Localization [J]. PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, ICMI 2023, 2023, : 15 - 23
- [42] Multimodal pattern matching for audio-visual query and retrieval [J]. STORAGE AND RETRIEVAL FOR MEDIA DATABASES 2001, 2001, 4315 : 188 - 195
- [43] An Audio-Visual Attention System for Online Association Learning [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2127 - 2130
- [44] DEEP AUDIO-VISUAL SPEECH SEPARATION WITH ATTENTION MECHANISM [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7314 - 7318
- [45] AUDIO-VISUAL EVENT RECOGNITION THROUGH THE LENS OF ADVERSARY [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 616 - 620
- [49] Tracking atoms with particles for audio-visual source localization [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PTS 1-3, 2007, : 753 - +
- [50] AUDIO-VISUAL SPEAKER LOCALIZATION VIA WEIGHTED CLUSTERING [J]. 2014 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2014,