Convolutional density estimation in hidden Markov models for speech recognition

被引:3
|
作者
Matsoukas, S [1 ]
Zavaliagkos, G [1 ]
机构
[1] BBN Syst & Technol Corp, GTE Internetworking, Cambridge, MA 02138 USA
关键词
D O I
10.1109/ICASSP.1999.758075
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In continuous density Hidden Markov Models (HMMs) for speech recognition, the probability density function (pdf) for each state is usually expressed as a mixture of Gaussians. In this paper, we present a model in which the pdf is expressed as the convolution of two densities. We focus on the special case where one of the convolved densities is a M-Gaussian mixture, and the other is a mixture of N impulses. We present the reestimation formulae for the parameters of the M x N convolutional model, and suggest two ways for initializing them, the residual K-Means approach, and the deconvolution from a standard HMM with MN Gaussians per state using a genetic algorithm to search for the optimal assignment of Gaussians. Both methods result in a compact representation that requires only O(M + N) storage space for the model parameters, and O(MN) time for training and decoding. We explain how the decoding time can be reduced to O(M + kN), where k < M. Finally results are shown on the 1996 Hub-rf Development test, demonstrating that a 32 x 2 convolutional model can achieve performance comparable to that of a standard 64-Gaussian per state model.
引用
收藏
页码:113 / 116
页数:4
相关论文
共 50 条
  • [1] Convolutional density estimation in hidden Markov models for speech recognition
    Matsoukas, Spyros
    Zavaliagkos, George
    [J]. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 1999, 1 : 113 - 116
  • [2] HIDDEN MARKOV MODELS IN SPEECH RECOGNITION
    Krajcovic, J.
    Hrncar, M.
    Muzikarova, E.
    [J]. ADVANCES IN ELECTRICAL AND ELECTRONIC ENGINEERING, 2008, 7 (1-2) : 250 - 252
  • [3] Noisy Hidden Markov Models for Speech Recognition
    Audhkhasi, Kartik
    Osoba, Osonde
    Kosko, Bart
    [J]. 2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,
  • [4] The Application of Hidden Markov Models in Speech Recognition
    Gales, Mark
    Young, Steve
    [J]. FOUNDATIONS AND TRENDS IN SIGNAL PROCESSING, 2007, 1 (03): : 195 - 304
  • [5] Hidden Markov models for speech and signal recognition
    Rose, RC
    Juang, BH
    [J]. CONTINUOUS WAVE-FORM ANALYSIS, 1996, (45): : 137 - 152
  • [6] HIDDEN MARKOV-MODELS FOR SPEECH RECOGNITION
    JUANG, BH
    RABINER, LR
    [J]. TECHNOMETRICS, 1991, 33 (03) : 251 - 272
  • [7] AUTOMATIC SPEECH RECOGNITION USING TIED DENSITY HIDDEN MARKOV-MODELS
    EULER, S
    [J]. FREQUENZ, 1992, 46 (11-12) : 274 - 279
  • [8] Boosted Large-Margin Estimation of Hidden Markov Models for Speech Recognition
    Xu Shuangyin
    Dan, Qu
    [J]. PROCEEDINGS OF 2012 IEEE 11TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP) VOLS 1-3, 2012, : 545 - 548
  • [9] Graphical Models for Discrete Hidden Markov Models in Speech Recognition
    Miguel, Antonio
    Ortega, Alfonso
    Buera, Luis
    Lleida, Eduardo
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1387 - 1390
  • [10] Automatic speech recognition using hidden Markov models
    Botros, N.M.
    Teh, C.K.
    [J]. Microcomputer Applications, 1994, 13 (01): : 6 - 12