共 50 条
- [31] Keyword-based speaker localization: Localizing a target speaker in a multi-speaker environment [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2703 - 2707
- [32] Improving multi-speaker TTS prosody variance with a residual encoder and normalizing flows [J]. INTERSPEECH 2021, 2021, : 3131 - 3135
- [33] Sparse DNN-based speaker segmentation using side information [J]. ELECTRONICS LETTERS, 2015, 51 (08) : 651 - 653
- [34] Improving Multi-Speaker Tacotron with Speaker Gating Mechanisms [J]. PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 7498 - 7503
- [35] Zero-shot multi-speaker accent TTS with limited accent data [J]. 2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 1931 - 1936
- [36] A hybrid approach to speaker recognition in multi-speaker environment [J]. PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PROCEEDINGS, 2005, 3776 : 272 - 275
- [37] CAN WE USE COMMON VOICE TO TRAIN A MULTI-SPEAKER TTS SYSTEM? [J]. 2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 900 - 905
- [38] Speaker Clustering with Penalty Distance for Speaker Verification with Multi-Speaker Speech [J]. 2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1630 - 1635
- [39] Automatic speaker clustering from multi-speaker utterances [J]. ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 817 - 820
- [40] SYNTHESIZING DYSARTHRIC SPEECH USING MULTI-SPEAKER TTS FOR DYSARTHRIC SPEECH RECOGNITION [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7382 - 7386