Discriminative training for concatenative speech synthesis

被引:6
|
作者
Kim, NS [1 ]
Park, SS
机构
[1] Seoul Natl Univ, Sch Elect Engn, Seoul 151742, South Korea
[2] Seoul Natl Univ, INMC, Seoul 151742, South Korea
关键词
discriminative training; speech synthesis; unit selection;
D O I
10.1109/LSP.2003.819345
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this letter, we propose an approach to train the cost functions used for unit selection in concatenative speech synthesis. We first view the unit selection as' a classification problem, and we apply the discriminative training technique, which is found to be an efficient way to perform parameter estimation in speech recognition. Instead of defining an objective function that accounts for the subjective speech quality, We take the classification. error as' the objective function to be optimized. The classification error is approximated by a smooth function, and the relevant parameters are updated by means of the gradient descent technique.
引用
收藏
页码:40 / 43
页数:4
相关论文
共 50 条
  • [1] LSM-based boundary training for concatenative speech synthesis
    Bellegarda, Jerome R.
    [J]. 2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 721 - 724
  • [2] SET OF CONCATENATIVE UNITS FOR SPEECH SYNTHESIS
    OLIVE, J
    LIBERMAN, M
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1979, 65 : S130 - S130
  • [3] On the detection of discontinuities in concatenative speech synthesis
    Pantazis, Yannis
    Stylianou, Yannis
    [J]. PROGRESS IN NONLINEAR SPEECH PROCESSING, 2007, 4391 : 89 - +
  • [4] Spectral modification for concatenative speech synthesis
    Wouters, J
    Macon, MW
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 941 - 944
  • [5] Concatenative Resynthesis with Improved Training Signals for Speech Enhancement
    Syed, Ali Raza
    Trinh Viet Anh
    Mandel, Michael I.
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1195 - 1199
  • [6] Forward masking phenomenon in concatenative speech synthesis
    Cernak, M
    Rozinaj, G
    [J]. PROCEEDINGS EC-VIP-MC 2003, VOLS 1 AND 2, 2003, : 691 - 694
  • [7] Automatic Labeling Schemes for Concatenative Speech Synthesis
    Kacur, Juraj
    Cepko, Jozef
    Palenik, Andrej
    [J]. PROCEEDINGS ELMAR-2008, VOLS 1 AND 2, 2008, : 639 - 642
  • [8] A Concatenative Synthesis Based Speech Synthesiser for Hindi
    Gupta, Kshitij
    [J]. ADVANCES IN COMPUTER AND INFORMATIOM SCIENCES AND ENGINEERING, 2008, : 261 - 264
  • [9] Acoustic speech unit segmentation for concatenative synthesis
    Torres, H. M.
    Gurlekian, J. A.
    [J]. COMPUTER SPEECH AND LANGUAGE, 2008, 22 (02): : 196 - 206
  • [10] Control of spectral dynamics in concatenative speech synthesis
    Wouters, J
    Macon, MW
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (01): : 30 - 38