Automatic speech recognition based on weighted minimum classification error (W-MCE) training method

被引:3
|
作者
Fu, Qiang [1 ]
Juang, Biing-Hwang [1 ]
机构
[1] Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA 30332 USA
关键词
non-uniform error cost; weighted MCE;
D O I
10.1109/ASRU.2007.4430124
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Bayes decision theory [1] is the foundation of the classical statistical pattern recognition approach. For most of pattern recognition problems, the Bayes decision theory is employed assuming that the system performance metric is defined as the simple error counting, which assigns identical cost to each recognition error. However, this prevalent performance metric is not desirable in many practical applications. For example, the cost of "recognition" error is required to be differentiated in keyword spotting systems. In this paper, we propose an extended framework for the speech recognition problem with non-uniform classification/recognition error cost. As the system performance metric, the recognition error is weighted based on the task objective. The Bayes decision theory is employed according to this performance metric and the decision rule with a non-uniform error cost function is derived. We argue that the minimum classification error (MCE) method, after appropriate generalization, is the most suitable training algorithm for the "optimal" classifier design to minimize the weighted error rate. We formulate the weighted MCE (W-MCE) algorithm based on the conventional MCE infrastructure by integrating the error cost and the recognition error count into one objective function. In the context of automatic speech recognition (ASR), we present a variety of training scenarios and weighting strategies under this extended framework. The experimental demonstration for large vocabulary continuous speech recognition is provided to support the effectiveness of our approach.
引用
收藏
页码:278 / 283
页数:6
相关论文
共 50 条
  • [21] A minimum classification error method for face recognition
    Chen, LH
    Chen, JR
    Liang, D
    Deng, SH
    Liao, HY
    SEVENTH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING AND ITS APPLICATIONS, 1999, (465): : 630 - 633
  • [22] Minimum classification error training of landmark models for real-time continuous speech recognition
    McDermott, E
    Hazen, TJ
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 937 - 940
  • [23] Modelling uncertainty in stochastic vector mapping with minimum classification error training for robust speech recognition
    Wu, J
    Huo, Q
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS: SPEECH II; INDUSTRY TECHNOLOGY TRACKS; DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS; NEURAL NETWORKS FOR SIGNAL PROCESSING, 2003, : 97 - 100
  • [24] Minimum classification error for large scale speech recognition tasks using weighted finite state transducers
    McDermott, E
    Katagiri, S
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 113 - 116
  • [25] A genetic classification error method for speech recognition
    Kwong, S
    He, QH
    Ku, KW
    Chan, TM
    Man, KF
    Tang, KS
    SIGNAL PROCESSING, 2002, 82 (05) : 737 - 748
  • [26] MINIMUM CLASSIFICATION ERROR TRAINING WITH AUTOMATIC SETTING OF LOSS SMOOTHNESS
    Watanabe, Hideyuki
    Tokuno, Jun'ichi
    Ohashi, Tsukasa
    Katagiri, Shigeru
    Ohsaki, Miho
    2011 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2011,
  • [27] Automatic Loss Smoothness Determination for Minimum Classification Error Training
    Tokuno, Jun'ichi
    Ohashi, Tsukasa
    Watanabe, Hideyuki
    Katagiri, Shigeru
    Ohsaki, Miho
    2011 IEEE REGION 10 CONFERENCE TENCON 2011, 2011, : 69 - 73
  • [28] Large-margin minimum classification error training for large-scale speech recognition tasks
    Yu, Dong
    Deng, Li
    He, Xiaodong
    Acero, Alex
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 1137 - +
  • [29] Minimum classification error training for online handwritten word recognition
    Biem, A
    EIGHTH INTERNATIONAL WORKSHOP ON FRONTIERS IN HANDWRITING RECOGNITION: PROCEEDINGS, 2002, : 61 - 66
  • [30] Simultaneous ANN feature and HMM recognizer design using string-based minimum classification error (MCE) training
    Rahim, MG
    Lee, CH
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1824 - 1827