Automatic generation of speech synthesis units based on closed loop training

被引：0

作者：

Kagoshima, T

Akamine, M

机构：

来源：

1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS | 1997年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper proposes a new method for automatically generating speech synthesis units. A small set of synthesis units is selected from a large speech database by the proposed Closed-Loop Training method (CLT). Because CLT is based on the evaluation and minimization of the distortion caused by the synthesis process such as prosodic modification ! the selected synthesis units are most suitable for synthesizers. In this paper, CLT is applied to a waveform concatenation based synthesizer, whose basic unit is CV/VC(diphone). It is shown that synthesis units can be efficiently generated by CLT from a labeled speech database with a small amount of computation. Moreover, the synthesized speech is clear and smooth even though the storage size of the waveform dictionary is small.

引用

页码：963 / 966

页数：4

共 50 条

[31] Discriminative Training for Automatic Speech Recognition
Heigold, Georg
Ney, Hermann
Schlueter, Ralf
Wiesler, Simon
[J]. IEEE SIGNAL PROCESSING MAGAZINE, 2012, 29 (06) : 58 - 69
[32] Automatic Speech Recognition and Pronunciation Training
Xiao, Wenqi
[J]. PROCEEDINGS OF THE 2018 2ND INTERNATIONAL CONFERENCE ON EDUCATION, ECONOMICS AND MANAGEMENT RESEARCH (ICEEMR 2018), 2018, 182 : 466 - 468
[33] Speech parameter generation algorithms for HMM-based speech synthesis
Tokuda, K
Yoshimura, T
Masuko, T
Kobayashi, T
Kitamura, T
[J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1315 - 1318
[34] Automatic capitalisation generation for speech input
Kim, JH
Woodland, PC
[J]. COMPUTER SPEECH AND LANGUAGE, 2004, 18 (01): : 67 - 90
[35] MINIMUM GENERATION ERROR TRAINING WITH WEIGHTED EUCLIDEAN DISTANCE ON LSP FOR HMM-BASED SPEECH SYNTHESIS
Lei, Ming
Ling, Zhen-Hua
Dai, Li-Rong
[J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4230 - 4233
[36] Closed-Loop Training for Projected GAN
Zhao, Jiangwei
Zhang, Liang
Pan, Lili
Li, Hongliang
[J]. IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 106 - 110
[37] Closed loop dynamic bit allocation for excitation parameters in analysis-by-synthesis speech codec
Ashley, James P.
Mittal, Udar
[J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 1109 - +
[38] DSP Processer-in-the-Loop Tests Based on Automatic Code Generation
Zhang, Qi
Pei, Wenhui
[J]. INVENTIONS, 2022, 7 (01)
[39] TOWARDS CLOSED-LOOP SPEECH SYNTHESIS FROM STEREOTACTIC EEG: A UNIT SELECTION APPROACH
Angrick, Miguel
Ottenhoff, Maarten
Diener, Lorenz
Ivucic, Darius
Ivucic, Gabriel
Goulis, Sophocles
Colon, Albert J.
Wagner, Louis
Krusienski, Dean J.
Kubben, Pieter L.
Schultz, Tanja
Herff, Christian
[J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 1296 - 1300
[40] Closed loop control structures automatic set point generation in programming by demonstration for service robotic tasks
She, Haiying
Graeser, Axel
[J]. OPTIM 04: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON OPTIMIZATION OF ELECTRICAL AND ELECTRONIC EQUIPMENT, VOL II: POWER ELECTRONICS, ELECTRICAL MACHINES AND DRIVES, 2004, : 231 - 238

← 1 2 3 4 5 →