共 46 条
- [1] Adversarial Speaker-Consistency Learning Using Untranscribed Speech Data for Zero-Shot Multi-Speaker Text-to-Speech [J]. PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 1708 - 1712
- [3] ECAPA-TDNN for Multi-speaker Text-to-speech Synthesis [J]. 2022 13TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2022, : 230 - 234
- [4] Deep Voice 2: Multi-Speaker Neural Text-to-Speech [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
- [5] Lightweight, Multi-Speaker, Multi-Lingual Indic Text-to-Speech [J]. IEEE OPEN JOURNAL OF SIGNAL PROCESSING, 2024, 5 : 790 - 798
- [6] Multi-speaker Text-to-speech Synthesis Using Deep Gaussian Processes [J]. INTERSPEECH 2020, 2020, : 2032 - 2036
- [7] LIGHTSPEECH: LIGHTWEIGHT NON-AUTOREGRESSIVE MULTI-SPEAKER TEXT-TO-SPEECH [J]. 2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 499 - 506
- [8] Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
- [9] Training Multi-Speaker Neural Text-to-Speech Systems using Speaker-Imbalanced Speech Corpora [J]. INTERSPEECH 2019, 2019, : 1303 - 1307
- [10] INVESTIGATING ON INCORPORATING PRETRAINED AND LEARNABLE SPEAKER REPRESENTATIONS FOR MULTI-SPEAKER MULTI-STYLE TEXT-TO-SPEECH [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 8588 - 8592