Minimum segmentation error based discriminative training for speech synthesis application

被引:0
|
作者
Wu, YJ
Kawai, H
Ni, JF
Wang, RH
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In the conventional HMM-based segmentation method, the HMM training is based on MLE criteria, which links the segmentation task to the problem of distribution estimation. The HMMs are built to identify the phonetic segments, not to detect the boundary. This kind of inconsistency between training and application limited the performance of segmentation. In this paper, we adopt the discriminative training method and introduce a new criterion, named Minimum Segmentation Error (MSGE), for HMM training. In this method, a loss function directly related to the segmentation error is defined. By minimizing the overall empirical loss with the Generalized Probabilistic Descent (GPD) algorithm, the segmentation error is also minimized. From the results on both Chinese and Japanese data, the accuracy of segmentation is improved. Moreover, this method is robust even when we do not have enough knowledge on HMM modeling, e.g. the number of states is not optimized.
引用
收藏
页码:629 / 632
页数:4
相关论文
共 50 条
  • [31] Sample Training Based Wildfire Segmentation by 2D Histogram θ-Division with Minimum Error
    Zhao, Jianhui
    Dong, Erqian
    Sun, Mingui
    Jia, Wenyan
    Zhang, Dengyi
    Yuan, Zhiyong
    SCIENTIFIC WORLD JOURNAL, 2013,
  • [32] String-based minimum verification error (SB-MVE) training for speech recognition
    AT&T Lab-Research, Murray Hill, United States
    Comput Speech Lang, 2 (147-160):
  • [33] String-based minimum verification error (SB-MVE) training for speech recognition
    Rahim, MG
    Lee, CH
    COMPUTER SPEECH AND LANGUAGE, 1997, 11 (02): : 147 - 160
  • [34] An improved minimum generation error based model adaptation for HMM-based speech synthesis
    Wu, Yi-Jian
    Qin, Long
    Tokuda, Keiichi
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1727 - +
  • [35] Minimum word classification error training of HMMS for automatic speech recognition
    Yan, Zhi-Jie
    Zhu, Bo
    Hu, Yu
    Wang, Ren-Hua
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4521 - 4524
  • [36] DISCRIMINATIVE LINEAR TRANSFORM BASED ADAPTATION USING MINIMUM VERIFICATION ERROR
    Shin, Sunghwan
    Jung, Ho-Young
    Kim, Tae-Yoon
    Juang, Biing-Hwang
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4318 - 4321
  • [37] Automatic speech recognition based on weighted minimum classification error (W-MCE) training method
    Fu, Qiang
    Juang, Biing-Hwang
    2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 278 - 283
  • [38] Minimum classification error training in example based speech and pattern recognition using sparse weight matrices
    Matton, Mike
    Van Compernolle, Dirk
    Cools, Ronald
    JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 2010, 234 (04) : 1303 - 1311
  • [39] Minimum generation error linear regression based model adaptation for HMM-based speech synthesis
    Qin, Long
    Wu, Yi-Jian
    Ling, Zhen-Hua
    Wang, Ren-Hua
    Da, Li-Rong
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 3953 - +
  • [40] PROTOTYPE-BASED MINIMUM CLASSIFICATION ERROR GENERALIZED PROBABILISTIC DESCENT TRAINING FOR VARIOUS SPEECH UNITS
    MCDERMOTT, E
    KATAGIRI, S
    COMPUTER SPEECH AND LANGUAGE, 1994, 8 (04): : 351 - 368