共 50 条
- [1] LIMUSE: LIGHTWEIGHT MULTI-MODAL SPEAKER EXTRACTION [J]. 2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 488 - 495
- [2] MUSE: MULTI-MODAL TARGET SPEAKER EXTRACTION WITH VISUAL CUES [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6678 - 6682
- [3] A syntactic approach to automatic lip feature extraction for speaker identification [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 3693 - 3696
- [4] The use of temporal speech and lip information for multi-modal speaker identification via multi-stream HMM's [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 2389 - 2392
- [5] Automatic Group Cohesiveness Detection With Multi-modal Features [J]. ICMI'19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2019, : 577 - 581
- [6] Lip features automatic extraction [J]. 1998 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING - PROCEEDINGS, VOL 3, 1998, : 168 - 172
- [7] On-Line Multi-Modal Speaker Diarization [J]. ICMI'07: PROCEEDINGS OF THE NINTH INTERNATIONAL CONFERENCE ON MULTIMODAL INTERFACES, 2007, : 350 - 357
- [8] Automatic Detection and Verification of Pipeline Construction Features with Multi-modal data [J]. 2014 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2014), 2014, : 3116 - 3122
- [9] Speaker identification using speech and lip features [J]. PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), VOLS 1-5, 2005, : 2565 - 2570
- [10] MSDWILD: MULTI-MODAL SPEAKER DIARIZATION DATASET IN THE WILD [J]. INTERSPEECH 2022, 2022, : 1476 - 1480