Selective training for hidden Markov models with applications to speech classification

被引:37
|
作者
Arslan, LM [1 ]
Hansen, JHL [1 ]
机构
[1] Duke Univ, Dept Elect Engn, Robust Speech Proc Lab, Durham, NC 27708 USA
来源
关键词
classifier training; hidden Markov models; pattern classification; speech recognition;
D O I
10.1109/89.736330
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Traditional maximum likelihood estimation of hidden Markov model parameters aims at maximizing the overall probability across the training tokens of a given speech unit. As such, it disregards any interaction or biases across the models in the training procedure. Often, the resulting model parameters do not result in minimum error classification in the training set, A new selective training method is proposed that controls the influence of outliers in the training data on the generated models. The resulting models are shown to possess feature statistics which are more clearly separated for confusable patterns. The proposed selective training procedure is used for hidden Markov model training, with application to foreign accent classification, language identification, and speech recognition using the E-set alphabet. The resulting error rates are measurably improved over traditional forward-backward training under open test conditions, The proposed method is similar in terms of its goal to maximum mutual information estimation training, however it requires less computation, and the convergence properties of maximum likelihood estimation are retained in the new formulation.
引用
收藏
页码:46 / 54
页数:9
相关论文
共 50 条
  • [1] Fuzzy Hidden Markov Models for Indonesian Speech Classification
    Yulita, Intan Nurma
    The, Houw Liong
    Adiwijaya
    [J]. JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2012, 16 (03) : 381 - 387
  • [2] Nonparametric hidden Markov models: Principles and applications to speech recognition
    Trentin, E
    [J]. NEURAL NETS, 2003, 2859 : 3 - 21
  • [3] Large scale discriminative training of hidden Markov models for speech recognition
    Woodland, PC
    Povey, D
    [J]. COMPUTER SPEECH AND LANGUAGE, 2002, 16 (01): : 25 - 47
  • [4] Hidden Markov Models for Speech Recognition Technology Based on Classification and Identification
    Wei, Mingzhe
    Tang, Wanwei
    [J]. 2ND INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY FOR EDUCATION (ICTE 2015), 2015, : 266 - 269
  • [5] Markov models - training and evaluation of hidden Markov models
    Grewal, Jasleen K.
    Krzywinski, Martin
    Altman, Naomi
    [J]. NATURE METHODS, 2020, 17 (02) : 121 - 122
  • [6] Markov models — training and evaluation of hidden Markov models
    Jasleen K. Grewal
    Martin Krzywinski
    Naomi Altman
    [J]. Nature Methods, 2020, 17 : 121 - 122
  • [7] HIDDEN MARKOV MODELS IN SPEECH RECOGNITION
    Krajcovic, J.
    Hrncar, M.
    Muzikarova, E.
    [J]. ADVANCES IN ELECTRICAL AND ELECTRONIC ENGINEERING, 2008, 7 (1-2) : 250 - 252
  • [8] A TUTORIAL ON HIDDEN MARKOV-MODELS AND SELECTED APPLICATIONS IN SPEECH RECOGNITION
    RABINER, LR
    [J]. PROCEEDINGS OF THE IEEE, 1989, 77 (02) : 257 - 286
  • [9] Minimum Classification Error training of Hidden Markov Models for handwriting recognition
    Biem, AE
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 1529 - 1532
  • [10] Discriminative training of hidden Markov models using a classification measure criterion
    Chesta, C
    Girardi, A
    Laface, P
    Nigra, M
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 449 - 452