IMPROVING VOICE QUALITY OF HMM-BASED SPEECH SYNTHESIS USING VOICE CONVERSION METHOD

被引:0
|
作者
Jiao, Yishan [1 ]
Xie, Xiang [1 ]
Na, Xingyu [1 ]
Tu, Ming [1 ]
机构
[1] Beijing Inst Technol, Sch Informat & Elect, Beijing 100081, Peoples R China
关键词
HMM-based speech synthesis; voice conversion; local linear transformation; temporal decomposition;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
HMM-based speech synthesis system (HTS) often generates buzzy and muffled speech. Such degradation of voice quality makes synthetic speech sound robotically rather than naturally. From this point, we suppose that synthetic speech is in a different speaker space apart from the original. We propose to use voice conversion method to transform synthetic speech toward the original so as to improve its quality. Local linear transformation (LLT) combined with temporal decomposition (TD) is proposed as the conversion method. It can not only ensure smooth spectral conversion but also avoid over-smoothing problem. Moreover, we design a robust spectral selection and modification strategy to make the modified spectra stable. Preference test shows that the proposed method can improve the quality of HMM-based speech synthesis.
引用
收藏
页数:5
相关论文
共 50 条
  • [41] Robustness of HMM-based Speech Synthesis
    Yamagishi, Junichi
    Ling, Zhenhua
    King, Simon
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 581 - 584
  • [42] Czech HMM-Based Speech Synthesis
    Hanzlicek, Zdenek
    [J]. TEXT, SPEECH AND DIALOGUE, 2010, 6231 : 291 - 298
  • [43] Arabic HMM-based Speech Synthesis
    Khalil, Krichi Mohamed
    Adnan, Cherif
    [J]. 2013 INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND SOFTWARE APPLICATIONS (ICEESA), 2013, : 450 - 454
  • [44] A new HMM-based voice conversion methodology evaluated on monolingual and cross-lingual conversion tasks
    Percybrooks, Winston S.
    Moore, Elliot
    [J]. IEEE Transactions on Audio, Speech and Language Processing, 2015, 23 (12): : 2298 - 2310
  • [45] HMM-Based Vietnamese Speech Synthesis
    Trinh, Son
    Hoang, Kiem
    [J]. INTERNATIONAL JOURNAL OF SOFTWARE INNOVATION, 2015, 3 (04) : 33 - 47
  • [46] MULTI VOICE TEXT TO SPEECH SYNTHESIS BASED ON THE INSTANTANEOUS PARAMETRIC VOICE CONVERSION
    Azarov, Elias
    Petrovsky, Alexander
    Zubrycki, Piotr
    [J]. SPA 2010: SIGNAL PROCESSING ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS CONFERENCE PROCEEDINGS, 2010, : 78 - 82
  • [47] Voice conversion using HMM combined with GMM
    Yue Zhenjun
    Zou Xiang
    Jia Yongxing
    Wang Hao
    [J]. CISP 2008: FIRST INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOL 5, PROCEEDINGS, 2008, : 366 - 370
  • [48] HMM-based singing voice synthesis system using pitch-shifted pseudo training data
    Mase, Ayami
    Oura, Keiichiro
    Nankaku, Yoshihiko
    Tokuda, Keiichi
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 845 - 848
  • [49] An improved method for voice pathology detection by means of a HMM-based feature space transformation
    Arias-Londono, Julian D.
    Godino-Llorente, Juan I.
    Saenz-Lechon, Nicolas
    Osma-Ruiz, Victor
    Castellanos-Dominguez, German
    [J]. PATTERN RECOGNITION, 2010, 43 (09) : 3100 - 3112
  • [50] Prediction method of speech recognition performance based on HMM-based speech synthesis technique
    Terashima, Ryuta
    Yoshimura, Takayoshi
    Wakita, Toshihiro
    Tokuda, Keiichi
    Kitamura, Tadashi
    [J]. IEEJ Transactions on Electronics, Information and Systems, 2010, 130 (04) : 557 - 564