IMPROVING VOICE QUALITY OF HMM-BASED SPEECH SYNTHESIS USING VOICE CONVERSION METHOD

被引：0

作者：

Jiao, Yishan ^{[1
]}

Xie, Xiang ^{[1
]}

Na, Xingyu ^{[1
]}

Tu, Ming ^{[1
]}

机构：

[1] Beijing Inst Technol, Sch Informat & Elect, Beijing 100081, Peoples R China

来源：

2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2014年

关键词：

HMM-based speech synthesis; voice conversion; local linear transformation; temporal decomposition;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

HMM-based speech synthesis system (HTS) often generates buzzy and muffled speech. Such degradation of voice quality makes synthetic speech sound robotically rather than naturally. From this point, we suppose that synthetic speech is in a different speaker space apart from the original. We propose to use voice conversion method to transform synthetic speech toward the original so as to improve its quality. Local linear transformation (LLT) combined with temporal decomposition (TD) is proposed as the conversion method. It can not only ensure smooth spectral conversion but also avoid over-smoothing problem. Moreover, we design a robust spectral selection and modification strategy to make the modified spectra stable. Preference test shows that the proposed method can improve the quality of HMM-based speech synthesis.

引用

页数：5

共 50 条

[41] Robustness of HMM-based Speech Synthesis
Yamagishi, Junichi
Ling, Zhenhua
King, Simon
[J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 581 - 584
[42] Czech HMM-Based Speech Synthesis
Hanzlicek, Zdenek
[J]. TEXT, SPEECH AND DIALOGUE, 2010, 6231 : 291 - 298
[43] Arabic HMM-based Speech Synthesis
Khalil, Krichi Mohamed
Adnan, Cherif
[J]. 2013 INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND SOFTWARE APPLICATIONS (ICEESA), 2013, : 450 - 454
[44] A new HMM-based voice conversion methodology evaluated on monolingual and cross-lingual conversion tasks
Percybrooks, Winston S.
Moore, Elliot
[J]. IEEE Transactions on Audio, Speech and Language Processing, 2015, 23 (12): : 2298 - 2310
[45] HMM-Based Vietnamese Speech Synthesis
Trinh, Son
Hoang, Kiem
[J]. INTERNATIONAL JOURNAL OF SOFTWARE INNOVATION, 2015, 3 (04) : 33 - 47
[46] MULTI VOICE TEXT TO SPEECH SYNTHESIS BASED ON THE INSTANTANEOUS PARAMETRIC VOICE CONVERSION
Azarov, Elias
Petrovsky, Alexander
Zubrycki, Piotr
[J]. SPA 2010: SIGNAL PROCESSING ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS CONFERENCE PROCEEDINGS, 2010, : 78 - 82
[47] Voice conversion using HMM combined with GMM
Yue Zhenjun
Zou Xiang
Jia Yongxing
Wang Hao
[J]. CISP 2008: FIRST INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOL 5, PROCEEDINGS, 2008, : 366 - 370
[48] HMM-based singing voice synthesis system using pitch-shifted pseudo training data
Mase, Ayami
Oura, Keiichiro
Nankaku, Yoshihiko
Tokuda, Keiichi
[J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 845 - 848
[49] An improved method for voice pathology detection by means of a HMM-based feature space transformation
Arias-Londono, Julian D.
Godino-Llorente, Juan I.
Saenz-Lechon, Nicolas
Osma-Ruiz, Victor
Castellanos-Dominguez, German
[J]. PATTERN RECOGNITION, 2010, 43 (09) : 3100 - 3112
[50] Prediction method of speech recognition performance based on HMM-based speech synthesis technique
Terashima, Ryuta
Yoshimura, Takayoshi
Wakita, Toshihiro
Tokuda, Keiichi
Kitamura, Tadashi
[J]. IEEJ Transactions on Electronics, Information and Systems, 2010, 130 (04) : 557 - 564

← 1 2 3 4 5 →