PROTOTYPE-BASED MINIMUM CLASSIFICATION ERROR GENERALIZED PROBABILISTIC DESCENT TRAINING FOR VARIOUS SPEECH UNITS

被引:32
|
作者
MCDERMOTT, E
KATAGIRI, S
机构
[1] ATR Human Information Processing Research Laboratories, Kyoto, 619-02, Hikari-dai 2-2, Seika-cho
来源
COMPUTER SPEECH AND LANGUAGE | 1994年 / 8卷 / 04期
关键词
D O I
10.1006/csla.1994.1018
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In previous work we reported high classification rates for learning vector quantization (LVQ) networks trained to classify phoneme tokens shifted in time. It has since been shown that the framework of minimum classification error (MCE) and generalized probabilistic descent (GPD) can treat LVQ as a special case of a general method for gradient descent on a rigorously defined classification loss measure that closely reflects the misclassification rate. This framework allows us to extend LVQ into a prototype-based minimum error classifier (PBMEC) appropriate for the classification of various speech units which the original LVQ was unable to treat. Speech categories are represented using a prototype-based multi-state architecture incorporating a dynamic time warping procedure. We present results for the difficult E-set task, as well as for isolated word recognition for a vocabulary of 5240 words, that reveal clear gains in performance as a result of using PBMEC. In addition, we discuss the issue of smoothing the loss function from the perspective of increasing classifier robustness.
引用
收藏
页码:351 / 368
页数:18
相关论文
共 47 条
  • [1] PROTOTYPE-BASED MINIMUM ERROR TRAINING FOR SPEECH RECOGNITION
    MCDERMOTT, E
    KATAGIRI, S
    APPLIED INTELLIGENCE, 1994, 4 (03) : 245 - 256
  • [2] A Study of a New Misclassification Measure for Minimum Classification Error Training of Prototype-based Pattern Classifiers
    He, Tingting
    Huo, Qiang
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 2523 - 2526
  • [3] Prototype-based minimum error classifier for handwritten digits recognition
    Nopsuwanchai, R
    Biem, A
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS: DESIGN AND IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS INDUSTRY TECHNOLOGY TRACKS MACHINE LEARNING FOR SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING SIGNAL PROCESSING FOR EDUCATION, 2004, : 845 - 848
  • [4] A telephone-based directory assistance system adaptively trained using minimum classification error generalized probabilistic descent
    McDermott, E
    Woudenberg, EA
    Katagiri, S
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 3346 - 3349
  • [5] Prototype-based classification and error analysis under bootstrapping strategy
    Hwang, Doosung
    Son, Youngju
    INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2018, 10 (04) : 293 - 313
  • [6] Minimum word classification error training of HMMS for automatic speech recognition
    Yan, Zhi-Jie
    Zhu, Bo
    Hu, Yu
    Wang, Ren-Hua
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4521 - 4524
  • [7] Speech Pattern Classification Using Large Geometric Margin Minimum Classification Error Training
    Kitaoka, Mikiyo
    Hashimoto, Tetsuya
    Ochiai, Tsubasa
    Katagiri, Shigeru
    Ohsaki, Miho
    Watanabe, Hideyuki
    Lu, Xugang
    Kawai, Hisashi
    TENCON 2015 - 2015 IEEE REGION 10 CONFERENCE, 2015,
  • [8] A self-training hierarchical prototype-based approach for semi-supervised classification
    Gu, Xiaowei
    INFORMATION SCIENCES, 2020, 535 : 204 - 224
  • [9] Generalization of the Minimum Classification Error (MCE) Training Based on Maximizing Generalized Posterior Probability (GPP)
    Fu, Qiang
    Moreno-Daniel, Antonio
    Juang, Biing-Hwang
    Zhou, Jian-Lai
    Soong, Frank
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 681 - +
  • [10] Audio-visual speech recognition using minimum classification error training
    Miyajima, C
    Tokuda, K
    Kitamura, T
    NEURAL NETWORKS FOR SIGNAL PROCESSING X, VOLS 1 AND 2, PROCEEDINGS, 2000, : 3 - 12