SPEECH PARAMETER GENERATION CONSIDERING LSP ORDERING PROPERTY FOR HMM-BASED SPEECH SYNTHESIS

被引:0
|
作者
Qian, Shijun [1 ,2 ]
Wang, Huanliang [2 ]
Pei, Wenjiang [1 ]
Zou, Ping [2 ]
Wang, Kai [1 ]
机构
[1] Southeast Univ, Sch Informat Sci & Engn, Nanjing, Jiangsu, Peoples R China
[2] AL Speech Co Ltd, Suzhou, Peoples R China
基金
国家教育部博士点专项基金资助;
关键词
Speech synthesis; hidden Markov model; parameter generation; line spectral pair; ordering property;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
LSP has many advantages for speech representation, especially correlates well to spectrum formants as long as the LSP parameters are strictly ordered and bounded. This ordering property cannot be guaranteed during HMM-based speech synthesis when LSP is adopted as the spectrum feature, because diagonal covariance is utilized and correlation between LSP dimensions is ignored, with the result that unstable issue will be caused in synthesized speech. In this paper, we modify the parameter generation criterion to preserve ordering property of generated LSPs, by considering not only the likelihoods for HMM and GV maximized in conventional method but also a mis-orderings penalty. Experimental results show that the proposed method can alleviate the mis-orderings significantly and achieve high quality synthesizing performance when the penalty weight is selected appropriately.
引用
收藏
页码:330 / 334
页数:5
相关论文
共 50 条
  • [41] HMM-based Tibetan Lhasa Speech Synthesis System
    Wu Zhiqiang
    Yu Hongzhi
    Li Guanyu
    Wan Shuhui
    [J]. 2013 3RD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), 2013, : 92 - 95
  • [42] DIALOGUE CONTEXT SENSITIVE HMM-BASED SPEECH SYNTHESIS
    Tsiakoulis, Pirros
    Breslin, Catherine
    Gasic, Milica
    Henderson, Matthew
    Kim, Dongho
    Szummer, Martin
    Thomson, Blaise
    Young, Steve
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [43] Evaluation of the Slovenian HMM-based speech synthesis system
    Vesnicer, B
    Mihelic, F
    [J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2004, 3206 : 513 - 520
  • [44] The Design and Implementation of HMM-based Dai Speech Synthesis
    Wang, Zhan
    Yang, Jian
    Yang, Xin
    [J]. 2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
  • [45] An HMM-based speech synthesis system applied to English
    Tokuda, K
    Zen, H
    Black, AW
    [J]. PROCEEDINGS OF THE 2002 IEEE WORKSHOP ON SPEECH SYNTHESIS, 2002, : 227 - 230
  • [46] REACTIVE AND CONTINUOUS CONTROL OF HMM-BASED SPEECH SYNTHESIS
    Astrinaki, Maria
    d'Alessandro, Nicolas
    Picart, Benjamin
    Drugman, Thomas
    Dutoit, Thierry
    [J]. 2012 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2012), 2012, : 252 - 257
  • [47] An improved minimum generation error based model adaptation for HMM-based speech synthesis
    Wu, Yi-Jian
    Qin, Long
    Tokuda, Keiichi
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1727 - +
  • [48] Creation of HMM-based Speech Model for Estonian Text-to-Speech Synthesis
    Nurk, Tonis
    [J]. HUMAN LANGUAGE TECHNOLOGIES: THE BALTIC PERSPECTIVE, 2012, 247 : 162 - 168
  • [49] Prediction method of speech recognition performance based on HMM-based speech synthesis technique
    Terashima, Ryuta
    Yoshimura, Takayoshi
    Wakita, Toshihiro
    Tokuda, Keiichi
    Kitamura, Tadashi
    [J]. IEEJ Transactions on Electronics, Information and Systems, 2010, 130 (04) : 557 - 564
  • [50] Synthesis and evaluation of conversational characteristics in HMM-based speech synthesis
    Andersson, Sebastian
    Yamagishi, Junichi
    Clark, Robert A. J.
    [J]. SPEECH COMMUNICATION, 2012, 54 (02) : 175 - 188