Discriminative training for concatenative speech synthesis

被引：6

作者：

Kim, NS ^{[1
]}

Park, SS

机构：

[1] Seoul Natl Univ, Sch Elect Engn, Seoul 151742, South Korea

[2] Seoul Natl Univ, INMC, Seoul 151742, South Korea

来源：

IEEE SIGNAL PROCESSING LETTERS | 2004年 / 11卷 / 01期

关键词：

discriminative training; speech synthesis; unit selection;

D O I：

10.1109/LSP.2003.819345

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this letter, we propose an approach to train the cost functions used for unit selection in concatenative speech synthesis. We first view the unit selection as' a classification problem, and we apply the discriminative training technique, which is found to be an efficient way to perform parameter estimation in speech recognition. Instead of defining an objective function that accounts for the subjective speech quality, We take the classification. error as' the objective function to be optimized. The classification error is approximated by a smooth function, and the relevant parameters are updated by means of the gradient descent technique.

引用

页码：40 / 43

页数：4

共 50 条

[1] LSM-based boundary training for concatenative speech synthesis
Bellegarda, Jerome R.
[J]. 2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 721 - 724
[2] SET OF CONCATENATIVE UNITS FOR SPEECH SYNTHESIS
OLIVE, J
LIBERMAN, M
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1979, 65 : S130 - S130
[3] On the detection of discontinuities in concatenative speech synthesis
Pantazis, Yannis
Stylianou, Yannis
[J]. PROGRESS IN NONLINEAR SPEECH PROCESSING, 2007, 4391 : 89 - +
[4] Spectral modification for concatenative speech synthesis
Wouters, J
Macon, MW
[J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 941 - 944
[5] Concatenative Resynthesis with Improved Training Signals for Speech Enhancement
Syed, Ali Raza
Trinh Viet Anh
Mandel, Michael I.
[J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1195 - 1199
[6] Forward masking phenomenon in concatenative speech synthesis
Cernak, M
Rozinaj, G
[J]. PROCEEDINGS EC-VIP-MC 2003, VOLS 1 AND 2, 2003, : 691 - 694
[7] Automatic Labeling Schemes for Concatenative Speech Synthesis
Kacur, Juraj
Cepko, Jozef
Palenik, Andrej
[J]. PROCEEDINGS ELMAR-2008, VOLS 1 AND 2, 2008, : 639 - 642
[8] A Concatenative Synthesis Based Speech Synthesiser for Hindi
Gupta, Kshitij
[J]. ADVANCES IN COMPUTER AND INFORMATIOM SCIENCES AND ENGINEERING, 2008, : 261 - 264
[9] Acoustic speech unit segmentation for concatenative synthesis
Torres, H. M.
Gurlekian, J. A.
[J]. COMPUTER SPEECH AND LANGUAGE, 2008, 22 (02): : 196 - 206
[10] Control of spectral dynamics in concatenative speech synthesis
Wouters, J
Macon, MW
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (01): : 30 - 38

← 1 2 3 4 5 →