Automatic generation of speech synthesis units based on closed loop training

被引:0
|
作者
Kagoshima, T
Akamine, M
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper proposes a new method for automatically generating speech synthesis units. A small set of synthesis units is selected from a large speech database by the proposed Closed-Loop Training method (CLT). Because CLT is based on the evaluation and minimization of the distortion caused by the synthesis process such as prosodic modification ! the selected synthesis units are most suitable for synthesizers. In this paper, CLT is applied to a waveform concatenation based synthesizer, whose basic unit is CV/VC(diphone). It is shown that synthesis units can be efficiently generated by CLT from a labeled speech database with a small amount of computation. Moreover, the synthesized speech is clear and smooth even though the storage size of the waveform dictionary is small.
引用
收藏
页码:963 / 966
页数:4
相关论文
共 50 条
  • [1] Automatic generation of synthesis units for trainable text-to-speech systems
    Hon, H
    Acero, A
    Huang, X
    Liu, J
    Plumpe, M
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 293 - 296
  • [2] CLOSED-LOOP DIGITAL AUTOMATIC GENERATION CONTROLLER
    SCOTT, DN
    CRESAP, RL
    PRIEBE, RF
    TEBRINK, DE
    TAKEUCHI, KA
    [J]. IEEE TRANSACTIONS ON POWER APPARATUS AND SYSTEMS, 1973, PA92 (06): : 1813 - 1813
  • [3] Automatic generation of subword units for speech recognition systems
    Singh, R
    Raj, B
    Stern, RM
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2002, 10 (02): : 89 - 99
  • [4] Closed loop fault diagnosis based on a nonlinear process model and automatic fuzzy rule generation
    Füssel, D
    Ballé, P
    Isermann, R
    [J]. (SAFEPROCESS'97): FAULT DETECTION, SUPERVISION AND SAFETY FOR TECHNICAL PROCESSES 1997, VOLS 1-3, 1998, : 349 - 354
  • [5] Minimum generation error training for HMM-based speech synthesis
    Wu, Yi-Jian
    Wang, Ren-Hua
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 89 - 92
  • [6] AUTOMATIC SYNTHESIS UNIT GENERATION FOR ENGLISH SPEECH SYNTHESIS BASED ON MULTILAYERED CONTEXT ORIENTED CLUSTERING
    NAKAJIMA, S
    [J]. SPEECH COMMUNICATION, 1994, 14 (04) : 313 - 324
  • [7] Closed-loop fault diagnosis based on a nonlinear process model and automatic fuzzy rule generation
    Ballé, P
    Fuessel, D
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2000, 13 (06) : 695 - 704
  • [8] Automatic generation of synthesis units and prosodic information for Chinese concatenative synthesis
    Wu, CH
    Chen, JH
    [J]. SPEECH COMMUNICATION, 2001, 35 (3-4) : 219 - 237
  • [9] Automatic Prosody Generation for Serbo-Croatian Speech Synthesis Based on Regression Trees
    Secujski, Milan
    Pekar, Darko
    Jakovljevic, Niksa
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 3164 - +
  • [10] Reconsidering Read and Spontaneous Speech: Causal Perspectives on the Generation of Training Data for Automatic Speech Recognition
    Gabler, Philipp
    Geiger, Bernhard C.
    Schuppler, Barbara
    Kern, Roman
    [J]. INFORMATION, 2023, 14 (02)