共 50 条
- [1] Pruning Self-Attention for Zero-Shot Multi-Speaker Text-to-Speech INTERSPEECH 2023, 2023, : 4299 - 4303
- [3] ZERO-SHOT MULTI-SPEAKER TEXT-TO-SPEECH WITH STATE-OF-THE-ART NEURAL SPEAKER EMBEDDINGS 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6184 - 6188
- [4] SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model INTERSPEECH 2021, 2021, : 3645 - 3649
- [6] NNSPEECH: SPEAKER-GUIDED CONDITIONAL VARIATIONAL AUTOENCODER FOR ZERO-SHOT MULTI-SPEAKER TEXT-TO-SPEECH 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4293 - 4297
- [7] Normalization Driven Zero-shot Multi-Speaker Speech Synthesis INTERSPEECH 2021, 2021, : 1354 - 1358
- [9] Adversarial Speaker-Consistency Learning Using Untranscribed Speech Data for Zero-Shot Multi-Speaker Text-to-Speech PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 1708 - 1712
- [10] Zero-shot multi-speaker accent TTS with limited accent data 2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 1931 - 1936