共 50 条
- [41] ECAPA-TDNN for Multi-speaker Text-to-speech Synthesis [J]. 2022 13TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2022, : 230 - 234
- [42] Phoneme Duration Modeling Using Speech Rhythm-Based Speaker Embeddings for Multi-Speaker Speech Synthesis [J]. INTERSPEECH 2021, 2021, : 3141 - 3145
- [43] ForumSum: A Multi-Speaker Conversation Summarization Dataset [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 4592 - 4599
- [44] SPEAKER CONDITIONING OF ACOUSTIC MODELS USING AFFINE TRANSFORMATION FOR MULTI-SPEAKER SPEECH RECOGNITION [J]. 2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, : 283 - 288
- [45] Speaker Diarization in a Multi-Speaker Environment Using Particle Swarm Optimization and Mutual Information [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-4, 2008, : 1533 - 1536
- [47] Human-in-the-loop Speaker Adaptation for DNN-based Multi-speaker TTS [J]. INTERSPEECH 2022, 2022, : 2968 - 2972
- [48] Synthesis of expressive speaking styles with limited training data in a multi-speaker, prosody-controllable sequence-to-sequence architecture [J]. INTERSPEECH 2021, 2021, : 4693 - 4697
- [50] Multi-speaker Text-to-speech Synthesis Using Deep Gaussian Processes [J]. INTERSPEECH 2020, 2020, : 2032 - 2036