共 50 条
- [41] SEEN AND UNSEEN EMOTIONAL STYLE TRANSFER FOR VOICE CONVERSION WITH A NEW EMOTIONAL SPEECH DATASET [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 920 - 924
- [43] Discrete/Continuous Modelling of Speaking Style in HMM-based Speech Synthesis: Design and Evaluation [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2796 - +
- [45] A COMPARISON OF SUPERVISED AND UNSUPERVISED CROSS-LINGUAL SPEAKER ADAPTATION APPROACHES FOR HMM-BASED SPEECH SYNTHESIS [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4598 - 4601
- [46] Building an English Speech Synthesis System from a Japanese ALS Patient's Voice [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1994 - +
- [47] Towards Multi-Scale Speaking Style Modelling with Hierarchical Context Information for Mandarin Speech Synthesis [J]. INTERSPEECH 2022, 2022, : 5523 - 5527
- [48] A study on time-dependent voice quality variation in a large-scale single speaker speech corpus used for speech synthesis [J]. PROCEEDINGS OF THE 2002 IEEE WORKSHOP ON SPEECH SYNTHESIS, 2002, : 15 - 18
- [49] CALM: Contrastive Cross-modal Speaking Style Modeling for Expressive Text-to-Speech Synthesis [J]. INTERSPEECH 2022, 2022, : 5533 - 5537
- [50] Speaking style adaptation using context clustering decision tree for HMM-based speech synthesis [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 5 - 8