PROTOTYPE-BASED MINIMUM CLASSIFICATION ERROR GENERALIZED PROBABILISTIC DESCENT TRAINING FOR VARIOUS SPEECH UNITS

被引：32

作者：

MCDERMOTT, E

KATAGIRI, S

机构：

[1] ATR Human Information Processing Research Laboratories, Kyoto, 619-02, Hikari-dai 2-2, Seika-cho

来源：

COMPUTER SPEECH AND LANGUAGE | 1994年 / 8卷 / 04期

关键词：

D O I：

10.1006/csla.1994.1018

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In previous work we reported high classification rates for learning vector quantization (LVQ) networks trained to classify phoneme tokens shifted in time. It has since been shown that the framework of minimum classification error (MCE) and generalized probabilistic descent (GPD) can treat LVQ as a special case of a general method for gradient descent on a rigorously defined classification loss measure that closely reflects the misclassification rate. This framework allows us to extend LVQ into a prototype-based minimum error classifier (PBMEC) appropriate for the classification of various speech units which the original LVQ was unable to treat. Speech categories are represented using a prototype-based multi-state architecture incorporating a dynamic time warping procedure. We present results for the difficult E-set task, as well as for isolated word recognition for a vocabulary of 5240 words, that reveal clear gains in performance as a result of using PBMEC. In addition, we discuss the issue of smoothing the loss function from the perspective of increasing classifier robustness.

引用

页码：351 / 368

页数：18

共 47 条

[41] Novel spotting-based approach to continuous speech recognition: minimum error classification of keyword-sequences
ATR Human Information Processing, Research Lab, Kyoto, Japan
Journal of the Acoustical Society of Japan (E) (English translation of Nippon Onkyo Gakkaishi), 1995, 16 (03): : 147 - 157
[42] Minimum generation error training with direct log spectral distortion on LSPs for HMM-based speech synthesis
Wu, Yi-Jian
Tokuda, Keiichi
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 577 - 580
[43] SAMPLE-SEPARATION-MARGIN BASED MINIMUM CLASSIFICATION ERROR TRAINING OF PATTERN CLASSIFIERS WITH QUADRATIC DISCRIMINANT FUNCTIONS
Wang, Yongqiang
Huo, Qiang
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 1866 - 1869
[44] Improving Trajectory Modelling for DNN-Based Speech Synthesis by Using Stacked Bottleneck Features and Minimum Generation Error Training
Wu, Zhizheng
King, Simon
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (07) : 1255 - 1265
[45] Simultaneous ANN feature and HMM recognizer design using string-based minimum classification error (MCE) training
Rahim, MG
Lee, CH
ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1824 - 1827
[46] Discriminative State-Dependent Weightings of Duration-Based State Transition Model through Minimum Classification Error Training
Kato, Yoshinaga
Muroi, Tetsuya
1600, John Wiley and Sons Inc. (31):
[47] Discriminative training based on minimum classification error for a small amount of data enhanced by vector-field-smoothed Bayesian learning
Takahashi, J
Sagayama, S
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1996, E79D (12) : 1700 - 1707

← 1 2 3 4 5 →