共 50 条
- [41] End-to-End Multi-Speaker Speech Recognition using Speaker Embeddings and Transfer Learning [J]. INTERSPEECH 2019, 2019, : 4425 - 4429
- [42] An Unsupervised Method to Select a Speaker Subset from Large Multi-Speaker Speech Synthesis Datasets [J]. INTERSPEECH 2020, 2020, : 1758 - 1762
- [43] MULTI-SPEAKER EMOTIONAL SPEECH SYNTHESIS WITH FINE-GRAINED PROSODY MODELING [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 5729 - 5733
- [44] MIMO Self-attentive RNN Beamformer for Multi-speaker Speech Separation [J]. INTERSPEECH 2021, 2021, : 1119 - 1123
- [45] Multi-speaker Text-to-speech Synthesis Using Deep Gaussian Processes [J]. INTERSPEECH 2020, 2020, : 2032 - 2036
- [46] INVESTIGATING ON INCORPORATING PRETRAINED AND LEARNABLE SPEAKER REPRESENTATIONS FOR MULTI-SPEAKER MULTI-STYLE TEXT-TO-SPEECH [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 8588 - 8592
- [47] Multi-Lingual Multi-Speaker Text-to-Speech Synthesis for Voice Cloning with Online Speaker Enrollment [J]. INTERSPEECH 2020, 2020, : 2932 - 2936
- [48] J-MAC: Japanese multi-speaker audiobook corpus for speech synthesis [J]. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2022, 2022-September : 2358 - 2362
- [49] MULTI-SPEAKER EMOTIONAL ACOUSTIC MODELING FOR CNN-BASED SPEECH SYNTHESIS [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6950 - 6954
- [50] J-MAC: Japanese multi-speaker audiobook corpus for speech synthesis [J]. INTERSPEECH 2022, 2022, : 2358 - 2362