共 50 条
- [3] NNSPEECH: SPEAKER-GUIDED CONDITIONAL VARIATIONAL AUTOENCODER FOR ZERO-SHOT MULTI-SPEAKER TEXT-TO-SPEECH [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4293 - 4297
- [4] SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model [J]. INTERSPEECH 2021, 2021, : 3645 - 3649
- [5] Adversarial Speaker-Consistency Learning Using Untranscribed Speech Data for Zero-Shot Multi-Speaker Text-to-Speech [J]. PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 1708 - 1712
- [10] Normalization Driven Zero-shot Multi-Speaker Speech Synthesis [J]. INTERSPEECH 2021, 2021, : 1354 - 1358