共 50 条
- [21] Phoneme Segmentation using Deep Learning for Speech Synthesis PROCEEDINGS OF THE 2018 CONFERENCE ON RESEARCH IN ADAPTIVE AND CONVERGENT SYSTEMS (RACS 2018), 2018, : 59 - 61
- [22] Controllable neural text-to-speech synthesis using intuitive prosodic features INTERSPEECH 2020, 2020, : 4432 - 4436
- [23] Semi-supervised learning for continuous emotional intensity controllable speech synthesis with disentangled representations INTERSPEECH 2023, 2023, : 4818 - 4822
- [25] LEARNING ACCENT REPRESENTATION WITH MULTI-LEVEL VAE TOWARDS CONTROLLABLE SPEECH SYNTHESIS 2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 928 - 935
- [26] Fluent Personalized Speech Synthesis with Prosodic Word-Level Spontaneous Speech generation 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 294 - 298
- [27] Autoregressive Co-Training for Learning Discrete Speech Representations INTERSPEECH 2022, 2022, : 5000 - 5004
- [28] UNSUPERVISED WORD-LEVEL PROSODY TAGGING FOR CONTROLLABLE SPEECH SYNTHESIS 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7597 - 7601
- [29] Learning Character-level Representations for Part-of-Speech Tagging INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 32 (CYCLE 2), 2014, 32 : 1818 - 1826
- [30] Learning Utterance-level Representations with Label Smoothing for Speech Emotion Recognition INTERSPEECH 2020, 2020, : 4079 - 4083