共 50 条
- [1] LIMUSE: LIGHTWEIGHT MULTI-MODAL SPEAKER EXTRACTION [J]. 2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 488 - 495
- [2] Multi-Modal Anomaly Detection by Using Audio and Visual Cues [J]. IEEE ACCESS, 2021, 9 : 30587 - 30603
- [3] Automatic extraction of geometric lip features with application to multi-modal speaker identification [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO - ICME 2006, VOLS 1-5, PROCEEDINGS, 2006, : 161 - +
- [4] On-Line Multi-Modal Speaker Diarization [J]. ICMI'07: PROCEEDINGS OF THE NINTH INTERNATIONAL CONFERENCE ON MULTIMODAL INTERFACES, 2007, : 350 - 357
- [6] Audio-visual Speaker Recognition via Multi-modal Correlated Neural Networks [J]. 2016 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE WORKSHOPS (WIW 2016), 2016, : 123 - 128
- [7] MSDWILD: MULTI-MODAL SPEAKER DIARIZATION DATASET IN THE WILD [J]. INTERSPEECH 2022, 2022, : 1476 - 1480
- [8] MAAS: Multi-modal Assignation for Active Speaker Detection [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 265 - 274
- [9] Visual Prompt Multi-Modal Tracking [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 9516 - 9526
- [10] VISUAL AS MULTI-MODAL ARGUMENTATION IN LAW [J]. BRATISLAVA LAW REVIEW, 2021, 5 (01): : 91 - 110