共 50 条
- [2] Deep Voice 2: Multi-Speaker Neural Text-to-Speech [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
- [3] Cross-lingual, Multi-speaker Text-To-Speech Synthesis Using Neural Speaker Embedding [J]. INTERSPEECH 2019, 2019, : 2105 - 2109
- [4] Speaker Clustering with Penalty Distance for Speaker Verification with Multi-Speaker Speech [J]. 2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1630 - 1635
- [5] Multi-speaker Text-to-speech Synthesis Using Deep Gaussian Processes [J]. INTERSPEECH 2020, 2020, : 2032 - 2036
- [6] PHONEME DEPENDENT SPEAKER EMBEDDING AND MODEL FACTORIZATION FOR MULTI-SPEAKER SPEECH SYNTHESIS AND ADAPTATION [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6930 - 6934
- [7] DNN based multi-speaker speech synthesis with temporal auxiliary speaker ID embedding [J]. 2019 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATION (ICEIC), 2019, : 61 - 64
- [8] Training Multi-Speaker Neural Text-to-Speech Systems using Speaker-Imbalanced Speech Corpora [J]. INTERSPEECH 2019, 2019, : 1303 - 1307
- [10] An Unsupervised Method to Select a Speaker Subset from Large Multi-Speaker Speech Synthesis Datasets [J]. INTERSPEECH 2020, 2020, : 1758 - 1762