共 50 条
- [1] On-Line Multi-Modal Speaker Diarization [J]. ICMI'07: PROCEEDINGS OF THE NINTH INTERNATIONAL CONFERENCE ON MULTIMODAL INTERFACES, 2007, : 350 - 357
- [2] MULTI-MODAL SPEAKER DIARIZATION OF REAL-WORLD MEETINGS USING COMPRESSED-DOMAIN VIDEO FEATURES [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4069 - +
- [3] LIMUSE: LIGHTWEIGHT MULTI-MODAL SPEAKER EXTRACTION [J]. 2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 488 - 495
- [4] MAAS: Multi-modal Assignation for Active Speaker Detection [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 265 - 274
- [7] Multi-modal Queried Object Detection in the Wild [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [8] SynDrone - Multi-modal UAV Dataset for Urban Scenarios [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 2202 - 2212
- [9] MMChat: Multi-Modal Chat Dataset on Social Media [J]. LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 5778 - 5786
- [10] A multi-modal dataset for gait recognition under occlusion [J]. APPLIED INTELLIGENCE, 2023, 53 (02) : 1517 - 1534