Emotional speech synthesis based on improved codebook mapping voice conversion

被引:0
|
作者
Wang, YP [1 ]
Ling, ZH [1 ]
Wang, RH [1 ]
机构
[1] Univ Sci & Technol China, iFlytek Speech Lab, Hefei 230026, Peoples R China
来源
AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION, PROCEEDINGS | 2005年 / 3784卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a spectral transformation method for emotional speech synthesis based on voice conversion framework. Three emotions are studied, including anger, happiness and sadness. For the sake of high naturalness, superior speech quality and emotion expressiveness, our original STASC system is modified by introducing a new feature selection strategy and hierarchical codebook mapping procedure. Our result shows that the LSF coefficients at low frequency carry more emotion-relative information, and therefore only these coefficients are converted. Listening tests prove that the proposed method can achieve a satisfactory balance between emotional expression and speech quality of converted speech signals.
引用
收藏
页码:374 / 381
页数:8
相关论文
共 50 条
  • [41] SPEECH AND VOICE SYNTHESIS
    THOMAS, MR
    BYTE, 1984, 9 (13): : 301 - 301
  • [42] On the transformation of the speech spectrum for voice conversion
    Baudoin, G
    Stylianou, Y
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1405 - 1408
  • [43] PHONETIC ANCHOR BASED STATE MAPPING FOR TEXTINDEPENDENT VOICE CONVERSION
    Zhang, Meng
    Tao, Jiaohua
    Nurminen, Jani
    Tian, Jilei
    Wang, Xia
    ICSP: 2008 9TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-5, PROCEEDINGS, 2008, : 723 - +
  • [44] Multi-level Prosody and Spectrum Conversion for Emotional Speech Synthesis
    Wang, Zexun
    Yu, Yibiao
    2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2014, : 588 - 593
  • [45] One-shot emotional voice conversion based on feature separation
    Lu, Wenhuan
    Zhao, Xinyue
    Guo, Na
    Li, Yongwei
    Wei, Jianguo
    Tao, Jianhua
    Dang, Jianwu
    SPEECH COMMUNICATION, 2022, 143 : 1 - 9
  • [46] Statistical parametric speech synthesis with a novel codebook-based excitation model
    Csapo, Tamas Gabor
    Nemeth, Geza
    INTELLIGENT DECISION TECHNOLOGIES-NETHERLANDS, 2014, 8 (04): : 289 - 299
  • [47] Codebook based face point trajectory synthesis algorithm using speech input
    Arslan, LM
    Talkin, D
    SPEECH COMMUNICATION, 1999, 27 (02) : 81 - 93
  • [48] Reducing over-smoothness in HMM-based speech synthesis using exemplar-based voice conversion
    Gia-Nhu Nguyen
    Trung-Nghia Phung
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2017,
  • [49] Reducing over-smoothness in HMM-based speech synthesis using exemplar-based voice conversion
    Gia-Nhu Nguyen
    Trung-Nghia Phung
    EURASIP Journal on Audio, Speech, and Music Processing, 2017
  • [50] AN EVALUATION OF ALARYNGEAL SPEECH ENHANCEMENT METHODS BASED ON VOICE CONVERSION TECHNIQUES
    Doi, Hironori
    Nakamura, Keigo
    Toda, Tomoki
    Saruwatari, Hiroshi
    Shikano, Kiyohiro
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5136 - 5139