Automatic generation of speech synthesis units based on closed loop training

被引:0
|
作者
Kagoshima, T
Akamine, M
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper proposes a new method for automatically generating speech synthesis units. A small set of synthesis units is selected from a large speech database by the proposed Closed-Loop Training method (CLT). Because CLT is based on the evaluation and minimization of the distortion caused by the synthesis process such as prosodic modification ! the selected synthesis units are most suitable for synthesizers. In this paper, CLT is applied to a waveform concatenation based synthesizer, whose basic unit is CV/VC(diphone). It is shown that synthesis units can be efficiently generated by CLT from a labeled speech database with a small amount of computation. Moreover, the synthesized speech is clear and smooth even though the storage size of the waveform dictionary is small.
引用
收藏
页码:963 / 966
页数:4
相关论文
共 50 条
  • [31] Discriminative Training for Automatic Speech Recognition
    Heigold, Georg
    Ney, Hermann
    Schlueter, Ralf
    Wiesler, Simon
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2012, 29 (06) : 58 - 69
  • [32] Automatic Speech Recognition and Pronunciation Training
    Xiao, Wenqi
    [J]. PROCEEDINGS OF THE 2018 2ND INTERNATIONAL CONFERENCE ON EDUCATION, ECONOMICS AND MANAGEMENT RESEARCH (ICEEMR 2018), 2018, 182 : 466 - 468
  • [33] Speech parameter generation algorithms for HMM-based speech synthesis
    Tokuda, K
    Yoshimura, T
    Masuko, T
    Kobayashi, T
    Kitamura, T
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1315 - 1318
  • [34] Automatic capitalisation generation for speech input
    Kim, JH
    Woodland, PC
    [J]. COMPUTER SPEECH AND LANGUAGE, 2004, 18 (01): : 67 - 90
  • [35] MINIMUM GENERATION ERROR TRAINING WITH WEIGHTED EUCLIDEAN DISTANCE ON LSP FOR HMM-BASED SPEECH SYNTHESIS
    Lei, Ming
    Ling, Zhen-Hua
    Dai, Li-Rong
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4230 - 4233
  • [36] Closed-Loop Training for Projected GAN
    Zhao, Jiangwei
    Zhang, Liang
    Pan, Lili
    Li, Hongliang
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 106 - 110
  • [37] Closed loop dynamic bit allocation for excitation parameters in analysis-by-synthesis speech codec
    Ashley, James P.
    Mittal, Udar
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 1109 - +
  • [38] DSP Processer-in-the-Loop Tests Based on Automatic Code Generation
    Zhang, Qi
    Pei, Wenhui
    [J]. INVENTIONS, 2022, 7 (01)
  • [39] TOWARDS CLOSED-LOOP SPEECH SYNTHESIS FROM STEREOTACTIC EEG: A UNIT SELECTION APPROACH
    Angrick, Miguel
    Ottenhoff, Maarten
    Diener, Lorenz
    Ivucic, Darius
    Ivucic, Gabriel
    Goulis, Sophocles
    Colon, Albert J.
    Wagner, Louis
    Krusienski, Dean J.
    Kubben, Pieter L.
    Schultz, Tanja
    Herff, Christian
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 1296 - 1300
  • [40] Closed loop control structures automatic set point generation in programming by demonstration for service robotic tasks
    She, Haiying
    Graeser, Axel
    [J]. OPTIM 04: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON OPTIMIZATION OF ELECTRICAL AND ELECTRONIC EQUIPMENT, VOL II: POWER ELECTRONICS, ELECTRICAL MACHINES AND DRIVES, 2004, : 231 - 238