An alternative approach of finding competing hypotheses for better minimum classification error training

被引：0

作者：

Tam, YC ^{[1
]}

Mak, B ^{[1
]}

机构：

[1] Hong Kong Univ Sci & Technol, Dept Comp Sci, Hong Kong, Hong Kong, Peoples R China

来源：

2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS | 2002年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

During minimum-classification-error (MCE) training, competing hypotheses against the correct one are commonly derived by the N-best algorithm. One problem with the N-best algorithm is that, in practice, some misclassified data can have very large misclassification distances from the N-best competitors and fall out of the steep/trainable region of the sigmoid function, and thus cannot be utilized effectively. Although one may alleviate the problem by adjusting the shape of the sigmoid and then using an appropriate learning rate, it requires careful tuning of these training parameters. In this paper, we propose using the nearest competing hypothesis instead of the traditional N-best hypotheses for MCE training. The aim is to keep the training data as close to the trainable region as possible. Consequently, the amount of "effective" training data is increased. Furthermore, by progressively beating the nearest competitors, the training seems to be more stable. We also design an approximation algorithm based on beam search to locate the nearest competing hypothesis efficiently. We compare the performance of MCE training using 1-nearest or 1-best competing hypotheses on the Aurora database and find that the new approach (using 1-nearest hypotheses) reduces the word error rates by 5.1% and 17.8% over the latter (of using the 1-best competing hypotheses) and the official Aurora baseline respectively.

引用

页码：101 / 104

页数：4

共 50 条

[1] Minimum classification error training for handwritten character recognition
Zhang, R
Ding, XQ
[J]. 16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL I, PROCEEDINGS, 2002, : 580 - 583
[2] Speaker identification using Minimum Classification Error training
Siohan, O
Rosenberg, AE
Parthasarathy, S
[J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 109 - 112
[3] Minimum classification error interactive training for speaker identification
Kida, Y
Yamamoto, H
Miyajima, C
Tokuda, K
Kitamura, T
[J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 641 - 644
[4] Minimum classification error/eigenvoices training for speaker identification
Valente, F
Wellekens, C
[J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS: SPEECH II; INDUSTRY TECHNOLOGY TRACKS; DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS; NEURAL NETWORKS FOR SIGNAL PROCESSING, 2003, : 213 - 216
[5] Soft GPD for minimum classification error rate training
Shi, BE
Yao, KS
Cao, ZG
[J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1253 - 1256
[6] Using minimum classification error training in dimensionality reduction
Wang, XC
Paliwal, KK
[J]. NEURAL NETWORKS FOR SIGNAL PROCESSING X, VOLS 1 AND 2, PROCEEDINGS, 2000, : 338 - 345
[7] Minimum classification error training for handwritten character recognition
Rui, Zhang
Xiaoqing, Ding
[J]. Proceedings - International Conference on Pattern Recognition, 2002, 16 (01): : 580 - 583
[8] Minimum classification error training for online handwriting recognition
Biem, A
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (07) : 1041 - 1051
[9] Experimental Evaluation of Kernel Minimum Classification Error Training
Tanaka, Hideaki
Watanabe, Hideyuki
Katagiri, Shigeru
Ohsaki, Miho
[J]. TENCON 2012 - 2012 IEEE REGION 10 CONFERENCE: SUSTAINABLE DEVELOPMENT THROUGH HUMANITARIAN TECHNOLOGY, 2012,
[10] An environment-compensated minimum classification error training approach based on stochastic vector mapping
Wu, Jian
Huo, Qiang
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (06): : 2147 - 2155

← 1 2 3 4 5 →