共 50 条
- [1] Detection and analysis of attention errors in sequence-to-sequence text-to-speech [J]. INTERSPEECH 2021, 2021, : 2746 - 2750
- [2] Investigation of learning abilities on linguistic features in sequence-to-sequence text-to-speech synthesis [J]. COMPUTER SPEECH AND LANGUAGE, 2021, 67
- [3] Unsupervised Learning For Sequence-to-sequence Text-to-speech For Low-resource Languages [J]. INTERSPEECH 2020, 2020, : 3161 - 3165
- [4] A UNIFIED SEQUENCE-TO-SEQUENCE FRONT-END MODEL FOR MANDARIN TEXT-TO-SPEECH SYNTHESIS [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6689 - 6693
- [5] Investigating the robustness of sequence-to-sequence text-to-speech models to imperfectly-transcribed training data [J]. INTERSPEECH 2019, 2019, : 1546 - 1550
- [6] Voice Transformer Network: Sequence-to-Sequence Voice Conversion Using Transformer with Text-to-Speech Pretraining [J]. INTERSPEECH 2020, 2020, : 4676 - 4680
- [7] Limited Data Emotional Voice Conversion Leveraging Text-to-Speech: Two-stage Sequence-to-Sequence Training [J]. INTERSPEECH 2021, 2021, : 811 - 815
- [8] Real-time neural text-to-speech with sequence-to-sequence acoustic model and WaveGlow or single Gaussian WaveRNN vocoders [J]. INTERSPEECH 2019, 2019, : 1308 - 1312
- [9] LEVERAGING SEQUENCE-TO-SEQUENCE SPEECH SYNTHESIS FOR ENHANCING ACOUSTIC-TO-WORD SPEECH RECOGNITION [J]. 2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 477 - 484
- [10] Semi-supervised Sequence-to-sequence ASR using Unpaired Speech and Text [J]. INTERSPEECH 2019, 2019, : 3790 - 3794