SPEECH PARAMETER GENERATION CONSIDERING LSP ORDERING PROPERTY FOR HMM-BASED SPEECH SYNTHESIS

被引：0

作者：

Qian, Shijun ^{[1
,2
]}

Wang, Huanliang ^{[2
]}

Pei, Wenjiang ^{[1
]}

Zou, Ping ^{[2
]}

Wang, Kai ^{[1
]}

机构：

[1] Southeast Univ, Sch Informat Sci & Engn, Nanjing, Jiangsu, Peoples R China

[2] AL Speech Co Ltd, Suzhou, Peoples R China

来源：

2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO) | 2012年

基金：

国家教育部博士点专项基金资助;

关键词：

Speech synthesis; hidden Markov model; parameter generation; line spectral pair; ordering property;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

LSP has many advantages for speech representation, especially correlates well to spectrum formants as long as the LSP parameters are strictly ordered and bounded. This ordering property cannot be guaranteed during HMM-based speech synthesis when LSP is adopted as the spectrum feature, because diagonal covariance is utilized and correlation between LSP dimensions is ignored, with the result that unstable issue will be caused in synthesized speech. In this paper, we modify the parameter generation criterion to preserve ordering property of generated LSPs, by considering not only the likelihoods for HMM and GV maximized in conventional method but also a mis-orderings penalty. Experimental results show that the proposed method can alleviate the mis-orderings significantly and achieve high quality synthesizing performance when the penalty weight is selected appropriately.

引用

页码：330 / 334

页数：5

共 50 条

[41] HMM-based Tibetan Lhasa Speech Synthesis System
Wu Zhiqiang
Yu Hongzhi
Li Guanyu
Wan Shuhui
[J]. 2013 3RD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), 2013, : 92 - 95
[42] DIALOGUE CONTEXT SENSITIVE HMM-BASED SPEECH SYNTHESIS
Tsiakoulis, Pirros
Breslin, Catherine
Gasic, Milica
Henderson, Matthew
Kim, Dongho
Szummer, Martin
Thomson, Blaise
Young, Steve
[J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[43] Evaluation of the Slovenian HMM-based speech synthesis system
Vesnicer, B
Mihelic, F
[J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2004, 3206 : 513 - 520
[44] The Design and Implementation of HMM-based Dai Speech Synthesis
Wang, Zhan
Yang, Jian
Yang, Xin
[J]. 2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
[45] An HMM-based speech synthesis system applied to English
Tokuda, K
Zen, H
Black, AW
[J]. PROCEEDINGS OF THE 2002 IEEE WORKSHOP ON SPEECH SYNTHESIS, 2002, : 227 - 230
[46] REACTIVE AND CONTINUOUS CONTROL OF HMM-BASED SPEECH SYNTHESIS
Astrinaki, Maria
d'Alessandro, Nicolas
Picart, Benjamin
Drugman, Thomas
Dutoit, Thierry
[J]. 2012 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2012), 2012, : 252 - 257
[47] An improved minimum generation error based model adaptation for HMM-based speech synthesis
Wu, Yi-Jian
Qin, Long
Tokuda, Keiichi
[J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1727 - +
[48] Creation of HMM-based Speech Model for Estonian Text-to-Speech Synthesis
Nurk, Tonis
[J]. HUMAN LANGUAGE TECHNOLOGIES: THE BALTIC PERSPECTIVE, 2012, 247 : 162 - 168
[49] Prediction method of speech recognition performance based on HMM-based speech synthesis technique
Terashima, Ryuta
Yoshimura, Takayoshi
Wakita, Toshihiro
Tokuda, Keiichi
Kitamura, Tadashi
[J]. IEEJ Transactions on Electronics, Information and Systems, 2010, 130 (04) : 557 - 564
[50] Synthesis and evaluation of conversational characteristics in HMM-based speech synthesis
Andersson, Sebastian
Yamagishi, Junichi
Clark, Robert A. J.
[J]. SPEECH COMMUNICATION, 2012, 54 (02) : 175 - 188

← 1 2 3 4 5 →