共 50 条
- [2] On-Line Multi-Modal Speaker Diarization [J]. ICMI'07: PROCEEDINGS OF THE NINTH INTERNATIONAL CONFERENCE ON MULTIMODAL INTERFACES, 2007, : 350 - 357
- [3] LIMUSE: LIGHTWEIGHT MULTI-MODAL SPEAKER EXTRACTION [J]. 2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 488 - 495
- [4] MULTI-MODAL FRONT-END FOR SPEAKER ACTIVITY DETECTION IN SMALL MEETINGS [J]. 2011 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, 2011, : 536 - 541
- [5] MSDWILD: MULTI-MODAL SPEAKER DIARIZATION DATASET IN THE WILD [J]. INTERSPEECH 2022, 2022, : 1476 - 1480
- [6] Multi-modal Fusion Framework with Particle Filter for Speaker Tracking [J]. INTERNATIONAL JOURNAL OF FUTURE GENERATION COMMUNICATION AND NETWORKING, 2012, 5 (04): : 65 - 76
- [7] Diarizing Large Corpora using Multi-modal Speaker Linking [J]. 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 602 - 606
- [8] MUSE: MULTI-MODAL TARGET SPEAKER EXTRACTION WITH VISUAL CUES [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6678 - 6682
- [10] Is Multi-Modal Necessarily Better? Robustness Evaluation of Multi-Modal Fake News Detection [J]. IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2023, 10 (06): : 3144 - 3158