PROTOTYPE-BASED MINIMUM CLASSIFICATION ERROR GENERALIZED PROBABILISTIC DESCENT TRAINING FOR VARIOUS SPEECH UNITS

被引:32
|
作者
MCDERMOTT, E
KATAGIRI, S
机构
[1] ATR Human Information Processing Research Laboratories, Kyoto, 619-02, Hikari-dai 2-2, Seika-cho
来源
COMPUTER SPEECH AND LANGUAGE | 1994年 / 8卷 / 04期
关键词
D O I
10.1006/csla.1994.1018
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In previous work we reported high classification rates for learning vector quantization (LVQ) networks trained to classify phoneme tokens shifted in time. It has since been shown that the framework of minimum classification error (MCE) and generalized probabilistic descent (GPD) can treat LVQ as a special case of a general method for gradient descent on a rigorously defined classification loss measure that closely reflects the misclassification rate. This framework allows us to extend LVQ into a prototype-based minimum error classifier (PBMEC) appropriate for the classification of various speech units which the original LVQ was unable to treat. Speech categories are represented using a prototype-based multi-state architecture incorporating a dynamic time warping procedure. We present results for the difficult E-set task, as well as for isolated word recognition for a vocabulary of 5240 words, that reveal clear gains in performance as a result of using PBMEC. In addition, we discuss the issue of smoothing the loss function from the perspective of increasing classifier robustness.
引用
收藏
页码:351 / 368
页数:18
相关论文
共 47 条