Automatic generation of speech synthesis units based on closed loop training

被引：0

作者：

Kagoshima, T

Akamine, M

机构：

来源：

1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS | 1997年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper proposes a new method for automatically generating speech synthesis units. A small set of synthesis units is selected from a large speech database by the proposed Closed-Loop Training method (CLT). Because CLT is based on the evaluation and minimization of the distortion caused by the synthesis process such as prosodic modification ! the selected synthesis units are most suitable for synthesizers. In this paper, CLT is applied to a waveform concatenation based synthesizer, whose basic unit is CV/VC(diphone). It is shown that synthesis units can be efficiently generated by CLT from a labeled speech database with a small amount of computation. Moreover, the synthesized speech is clear and smooth even though the storage size of the waveform dictionary is small.

引用

页码：963 / 966

页数：4

共 50 条

[1] Automatic generation of synthesis units for trainable text-to-speech systems
Hon, H
Acero, A
Huang, X
Liu, J
Plumpe, M
[J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 293 - 296
[2] CLOSED-LOOP DIGITAL AUTOMATIC GENERATION CONTROLLER
SCOTT, DN
CRESAP, RL
PRIEBE, RF
TEBRINK, DE
TAKEUCHI, KA
[J]. IEEE TRANSACTIONS ON POWER APPARATUS AND SYSTEMS, 1973, PA92 (06): : 1813 - 1813
[3] Automatic generation of subword units for speech recognition systems
Singh, R
Raj, B
Stern, RM
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2002, 10 (02): : 89 - 99
[4] Closed loop fault diagnosis based on a nonlinear process model and automatic fuzzy rule generation
Füssel, D
Ballé, P
Isermann, R
[J]. (SAFEPROCESS'97): FAULT DETECTION, SUPERVISION AND SAFETY FOR TECHNICAL PROCESSES 1997, VOLS 1-3, 1998, : 349 - 354
[5] Minimum generation error training for HMM-based speech synthesis
Wu, Yi-Jian
Wang, Ren-Hua
[J]. 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 89 - 92
[6] AUTOMATIC SYNTHESIS UNIT GENERATION FOR ENGLISH SPEECH SYNTHESIS BASED ON MULTILAYERED CONTEXT ORIENTED CLUSTERING
NAKAJIMA, S
[J]. SPEECH COMMUNICATION, 1994, 14 (04) : 313 - 324
[7] Closed-loop fault diagnosis based on a nonlinear process model and automatic fuzzy rule generation
Ballé, P
Fuessel, D
[J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2000, 13 (06) : 695 - 704
[8] Automatic generation of synthesis units and prosodic information for Chinese concatenative synthesis
Wu, CH
Chen, JH
[J]. SPEECH COMMUNICATION, 2001, 35 (3-4) : 219 - 237
[9] Automatic Prosody Generation for Serbo-Croatian Speech Synthesis Based on Regression Trees
Secujski, Milan
Pekar, Darko
Jakovljevic, Niksa
[J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 3164 - +
[10] Reconsidering Read and Spontaneous Speech: Causal Perspectives on the Generation of Training Data for Automatic Speech Recognition
Gabler, Philipp
Geiger, Bernhard C.
Schuppler, Barbara
Kern, Roman
[J]. INFORMATION, 2023, 14 (02)

← 1 2 3 4 5 →