共 50 条
- [1] Audio-Visual Clustering for 3D Speaker Localization [J]. MACHINE LEARNING FOR MULTIMODAL INTERACTION, PROCEEDINGS, 2008, 5237 : 86 - 97
- [2] Deep Audio-Visual Beamforming for Speaker Localization [J]. IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 1132 - 1136
- [3] Audio-visual speaker localization using graphical models [J]. 18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2006, : 291 - +
- [4] Probabilistic speaker localization in noisy enviromments by audio-visual integration [J]. 2006 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-12, 2006, : 4704 - +
- [6] AV16.3: An audio-visual corpus for speaker localization and tracking [J]. MACHINE LEARNING FOR MULTIMODAL INTERACTION, 2005, 3361 : 182 - 195
- [7] Audio-Visual Synchronisation for Speaker Diarisation [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2662 - +
- [8] Binaural Audio-Visual Localization [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 2961 - 2968
- [9] Audio-Visual Speaker Verification via Joint Cross-Attention [J]. SPEECH AND COMPUTER, SPECOM 2023, PT II, 2023, 14339 : 18 - 31
- [10] Real-time speaker localization and speech separation by audio-visual integration [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS I-IV, PROCEEDINGS, 2002, : 1043 - 1049