共 50 条
- [12] AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation with Unified Audio-Visual Speech Representation 2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 27315 - 27327
- [13] Tracking atoms with particles for audio-visual source localization 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PTS 1-3, 2007, : 753 - +
- [14] Integrated audio-visual processing for object localization and tracking MULTIMEDIA COMPUTING AND NETWORKING 1998, 1997, 3310 : 206 - 213
- [15] Audio-Visual Multi-Speaker Tracking Based On the GLMB Framework INTERSPEECH 2020, 2020, : 3082 - 3086
- [18] ACCOUNTING FOR ROOM ACOUSTICS IN AUDIO-VISUAL MULTI-SPEAKER TRACKING 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 6553 - 6557
- [19] Tracking the Active Speaker Based on a Joint Audio-Visual Observation Model 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOP (ICCVW), 2015, : 702 - 708
- [20] Audio-Visual Synchronisation for Speaker Diarisation 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2662 - +