共 50 条
- [42] IMPROVING NATURALNESS AND CONTROLLABILITY OF SEQUENCE-TO-SEQUENCE SPEECH SYNTHESIS BY LEARNING LOCAL PROSODY REPRESENTATIONS 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 5724 - 5728
- [43] Syllable-level representations of suprasegmental features for DNN-based text-to-speech synthesis 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3186 - 3190
- [44] Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation INTERSPEECH 2020, 2020, : 3191 - 3195
- [45] LEARNING UTTERANCE-LEVEL REPRESENTATIONS FOR SPEECH EMOTION AND AGE/GENDER RECOGNITION USING DEEP NEURAL NETWORKS 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5150 - 5154
- [46] Improve emotional speech synthesis quality by learning explicit and implicit representations with semi-supervised training INTERSPEECH 2022, 2022, : 5538 - 5542
- [47] Integrating Discrete Word-Level Style Variations into Non-Autoregressive Acoustic Models for Speech Synthesis INTERSPEECH 2022, 2022, : 5508 - 5512
- [50] Transfer learning based code-mixed part-of-speech tagging using character level representations for Indian languages Journal of Ambient Intelligence and Humanized Computing, 2023, 14 : 7207 - 7218