共 50 条
- [1] MSDWILD: MULTI-MODAL SPEAKER DIARIZATION DATASET IN THE WILD [J]. INTERSPEECH 2022, 2022, : 1476 - 1480
- [2] Developing On-Line Speaker Diarization System [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2739 - 2743
- [3] Multi-modal segmental models for on-line handwriting recognition [J]. 15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, PROCEEDINGS: PATTERN RECOGNITION AND NEURAL NETWORKS, 2000, : 247 - 250
- [4] A Multi-Modal Learning System for On-Line Surgical Action Segmentation [J]. 2020 INTERNATIONAL SYMPOSIUM ON MEDICAL ROBOTICS (ISMR), 2020, : 132 - 138
- [5] Never-ending learning system for on-line speaker diarization [J]. 2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 699 - 704
- [6] MULTI-MODAL SPEAKER DIARIZATION OF REAL-WORLD MEETINGS USING COMPRESSED-DOMAIN VIDEO FEATURES [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4069 - +
- [8] LIMUSE: LIGHTWEIGHT MULTI-MODAL SPEAKER EXTRACTION [J]. 2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 488 - 495
- [9] Multi-modal biometrics authentication using on-line signature and voice pitch [J]. 2006 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATIONS, VOLS 1 AND 2, 2006, : 363 - +
- [10] MAAS: Multi-modal Assignation for Active Speaker Detection [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 265 - 274