Emotional speech synthesis based on improved codebook mapping voice conversion

被引：0

作者：

Wang, YP ^{[1
]}

Ling, ZH ^{[1
]}

Wang, RH ^{[1
]}

机构：

[1] Univ Sci & Technol China, iFlytek Speech Lab, Hefei 230026, Peoples R China

来源：

AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION, PROCEEDINGS | 2005年 / 3784卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper presents a spectral transformation method for emotional speech synthesis based on voice conversion framework. Three emotions are studied, including anger, happiness and sadness. For the sake of high naturalness, superior speech quality and emotion expressiveness, our original STASC system is modified by introducing a new feature selection strategy and hierarchical codebook mapping procedure. Our result shows that the LSF coefficients at low frequency carry more emotion-relative information, and therefore only these coefficients are converted. Listening tests prove that the proposed method can achieve a satisfactory balance between emotional expression and speech quality of converted speech signals.

引用

页码：374 / 381

页数：8

共 50 条

[41] SPEECH AND VOICE SYNTHESIS
THOMAS, MR
BYTE, 1984, 9 (13): : 301 - 301
[42] On the transformation of the speech spectrum for voice conversion
Baudoin, G
Stylianou, Y
ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1405 - 1408
[43] PHONETIC ANCHOR BASED STATE MAPPING FOR TEXTINDEPENDENT VOICE CONVERSION
Zhang, Meng
Tao, Jiaohua
Nurminen, Jani
Tian, Jilei
Wang, Xia
ICSP: 2008 9TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-5, PROCEEDINGS, 2008, : 723 - +
[44] Multi-level Prosody and Spectrum Conversion for Emotional Speech Synthesis
Wang, Zexun
Yu, Yibiao
2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2014, : 588 - 593
[45] One-shot emotional voice conversion based on feature separation
Lu, Wenhuan
Zhao, Xinyue
Guo, Na
Li, Yongwei
Wei, Jianguo
Tao, Jianhua
Dang, Jianwu
SPEECH COMMUNICATION, 2022, 143 : 1 - 9
[46] Statistical parametric speech synthesis with a novel codebook-based excitation model
Csapo, Tamas Gabor
Nemeth, Geza
INTELLIGENT DECISION TECHNOLOGIES-NETHERLANDS, 2014, 8 (04): : 289 - 299
[47] Codebook based face point trajectory synthesis algorithm using speech input
Arslan, LM
Talkin, D
SPEECH COMMUNICATION, 1999, 27 (02) : 81 - 93
[48] Reducing over-smoothness in HMM-based speech synthesis using exemplar-based voice conversion
Gia-Nhu Nguyen
Trung-Nghia Phung
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2017,
[49] Reducing over-smoothness in HMM-based speech synthesis using exemplar-based voice conversion
Gia-Nhu Nguyen
Trung-Nghia Phung
EURASIP Journal on Audio, Speech, and Music Processing, 2017
[50] AN EVALUATION OF ALARYNGEAL SPEECH ENHANCEMENT METHODS BASED ON VOICE CONVERSION TECHNIQUES
Doi, Hironori
Nakamura, Keigo
Toda, Tomoki
Saruwatari, Hiroshi
Shikano, Kiyohiro
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5136 - 5139

← 1 2 3 4 5 →