An alternative approach of finding competing hypotheses for better minimum classification error training

被引:0
|
作者
Tam, YC [1 ]
Mak, B [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Comp Sci, Hong Kong, Hong Kong, Peoples R China
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
During minimum-classification-error (MCE) training, competing hypotheses against the correct one are commonly derived by the N-best algorithm. One problem with the N-best algorithm is that, in practice, some misclassified data can have very large misclassification distances from the N-best competitors and fall out of the steep/trainable region of the sigmoid function, and thus cannot be utilized effectively. Although one may alleviate the problem by adjusting the shape of the sigmoid and then using an appropriate learning rate, it requires careful tuning of these training parameters. In this paper, we propose using the nearest competing hypothesis instead of the traditional N-best hypotheses for MCE training. The aim is to keep the training data as close to the trainable region as possible. Consequently, the amount of "effective" training data is increased. Furthermore, by progressively beating the nearest competitors, the training seems to be more stable. We also design an approximation algorithm based on beam search to locate the nearest competing hypothesis efficiently. We compare the performance of MCE training using 1-nearest or 1-best competing hypotheses on the Aurora database and find that the new approach (using 1-nearest hypotheses) reduces the word error rates by 5.1% and 17.8% over the latter (of using the 1-best competing hypotheses) and the official Aurora baseline respectively.
引用
收藏
页码:101 / 104
页数:4
相关论文
共 50 条
  • [1] Minimum classification error training for handwritten character recognition
    Zhang, R
    Ding, XQ
    [J]. 16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL I, PROCEEDINGS, 2002, : 580 - 583
  • [2] Speaker identification using Minimum Classification Error training
    Siohan, O
    Rosenberg, AE
    Parthasarathy, S
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 109 - 112
  • [3] Minimum classification error interactive training for speaker identification
    Kida, Y
    Yamamoto, H
    Miyajima, C
    Tokuda, K
    Kitamura, T
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 641 - 644
  • [4] Minimum classification error/eigenvoices training for speaker identification
    Valente, F
    Wellekens, C
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS: SPEECH II; INDUSTRY TECHNOLOGY TRACKS; DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS; NEURAL NETWORKS FOR SIGNAL PROCESSING, 2003, : 213 - 216
  • [5] Soft GPD for minimum classification error rate training
    Shi, BE
    Yao, KS
    Cao, ZG
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1253 - 1256
  • [6] Using minimum classification error training in dimensionality reduction
    Wang, XC
    Paliwal, KK
    [J]. NEURAL NETWORKS FOR SIGNAL PROCESSING X, VOLS 1 AND 2, PROCEEDINGS, 2000, : 338 - 345
  • [7] Minimum classification error training for handwritten character recognition
    Rui, Zhang
    Xiaoqing, Ding
    [J]. Proceedings - International Conference on Pattern Recognition, 2002, 16 (01): : 580 - 583
  • [8] Minimum classification error training for online handwriting recognition
    Biem, A
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (07) : 1041 - 1051
  • [9] Experimental Evaluation of Kernel Minimum Classification Error Training
    Tanaka, Hideaki
    Watanabe, Hideyuki
    Katagiri, Shigeru
    Ohsaki, Miho
    [J]. TENCON 2012 - 2012 IEEE REGION 10 CONFERENCE: SUSTAINABLE DEVELOPMENT THROUGH HUMANITARIAN TECHNOLOGY, 2012,
  • [10] An environment-compensated minimum classification error training approach based on stochastic vector mapping
    Wu, Jian
    Huo, Qiang
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (06): : 2147 - 2155