共 50 条
- [22] A JOINT AUDIO-VISUAL APPROACH TO AUDIO LOCALIZATION 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 454 - 458
- [23] An Audio-Visual Dataset and Deep Learning Frameworks for Crowded Scene Classification 19TH INTERNATIONAL CONFERENCE ON CONTENT-BASED MULTIMEDIA INDEXING, CBMI 2022, 2022, : 23 - 28
- [25] A Student-Teacher Architecture for Dialog Domain Adaptation under the Meta-Learning Setting THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 13692 - 13700
- [26] Learning joint statistical models for audio-visual fusion and segregation ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 13, 2001, 13 : 772 - 778
- [27] Scene recognition with audio-visual sensor fusion Multisensor, Multisource Information Fusion: Architectures, Algorithms and Applications 2005, 2005, 5813 : 201 - 210
- [29] Scene-Aware Ensemble Learning for Robust Crowd Counting PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2021, PT II, 2021, 13020 : 360 - 372
- [30] Anchor-aware Deep Metric Learning for Audio-visual Retrieval PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024, 2024, : 211 - 219