PROTOTYPE-BASED MINIMUM CLASSIFICATION ERROR GENERALIZED PROBABILISTIC DESCENT TRAINING FOR VARIOUS SPEECH UNITS

被引:32
|
作者
MCDERMOTT, E
KATAGIRI, S
机构
[1] ATR Human Information Processing Research Laboratories, Kyoto, 619-02, Hikari-dai 2-2, Seika-cho
来源
COMPUTER SPEECH AND LANGUAGE | 1994年 / 8卷 / 04期
关键词
D O I
10.1006/csla.1994.1018
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In previous work we reported high classification rates for learning vector quantization (LVQ) networks trained to classify phoneme tokens shifted in time. It has since been shown that the framework of minimum classification error (MCE) and generalized probabilistic descent (GPD) can treat LVQ as a special case of a general method for gradient descent on a rigorously defined classification loss measure that closely reflects the misclassification rate. This framework allows us to extend LVQ into a prototype-based minimum error classifier (PBMEC) appropriate for the classification of various speech units which the original LVQ was unable to treat. Speech categories are represented using a prototype-based multi-state architecture incorporating a dynamic time warping procedure. We present results for the difficult E-set task, as well as for isolated word recognition for a vocabulary of 5240 words, that reveal clear gains in performance as a result of using PBMEC. In addition, we discuss the issue of smoothing the loss function from the perspective of increasing classifier robustness.
引用
收藏
页码:351 / 368
页数:18
相关论文
共 47 条
  • [21] Minimum Classification Error Training of Hidden Conditional Random Fields for Speech and Speaker Recognition
    Hong, Wei-Tyng
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2013, 29 (04) : 729 - 742
  • [22] Discriminative training for large-vocabulary speech recognition using minimum classification error
    McDermott, Erik
    Hazen, Timothy J.
    Le Roux, Jonathan
    Nakamura, Atsushi
    Katagiri, Shigeru
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (01): : 203 - 223
  • [23] Minimum classification error training of landmark models for real-time continuous speech recognition
    McDermott, E
    Hazen, TJ
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 937 - 940
  • [24] Modelling uncertainty in stochastic vector mapping with minimum classification error training for robust speech recognition
    Wu, J
    Huo, Q
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS: SPEECH II; INDUSTRY TECHNOLOGY TRACKS; DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS; NEURAL NETWORKS FOR SIGNAL PROCESSING, 2003, : 97 - 100
  • [25] Towards minimum perceptual error training for DNN-based speech synthesis
    Valentini-Botinhao, Cassia
    Wu, Zhizheng
    King, Simon
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 869 - 873
  • [26] MINIMUM PHONE ERROR MODEL TRAINING ON MERGED ACOUSTIC UNITS FOR TRANSCRIBING BILINGUAL CODE-SWITCHED SPEECH
    Yeh, Ching-Feng
    Lin, Yiu-Chang
    Lee, Lin-Shan
    2012 8TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, 2012, : 320 - 324
  • [27] Large-margin minimum classification error training for large-scale speech recognition tasks
    Yu, Dong
    Deng, Li
    He, Xiaodong
    Acero, Alex
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 1137 - +
  • [28] A frequency-weighted HMM based on minimum error classification for noisy speech recognition
    Matsumoto, H
    Ono, M
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1511 - 1514
  • [29] Combined evolutionary algorithm and minimum classification error training for DHMM based landmine detection
    Zhao, YX
    Chen, P
    Gader, P
    Zhang, Y
    DETECTION AND REMEDIATION TECHNOLOGIES FOR MINES AND MINELIKE TARGETS VII, PTS 1 AND 2, 2002, 4742 : 1038 - 1049
  • [30] Minimum Classification Error Training to Improve Discriminability of PCMM-Based Feature Compensation
    Kim, Wooil
    Ko, Hanseok
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2005, 24 (01): : 58 - 67