Convolutional density estimation in hidden Markov models for speech recognition

被引：3

作者：

Matsoukas, S ^{[1
]}

Zavaliagkos, G ^{[1
]}

机构：

[1] BBN Syst & Technol Corp, GTE Internetworking, Cambridge, MA 02138 USA

来源：

ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI | 1999年

关键词：

D O I：

10.1109/ICASSP.1999.758075

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In continuous density Hidden Markov Models (HMMs) for speech recognition, the probability density function (pdf) for each state is usually expressed as a mixture of Gaussians. In this paper, we present a model in which the pdf is expressed as the convolution of two densities. We focus on the special case where one of the convolved densities is a M-Gaussian mixture, and the other is a mixture of N impulses. We present the reestimation formulae for the parameters of the M x N convolutional model, and suggest two ways for initializing them, the residual K-Means approach, and the deconvolution from a standard HMM with MN Gaussians per state using a genetic algorithm to search for the optimal assignment of Gaussians. Both methods result in a compact representation that requires only O(M + N) storage space for the model parameters, and O(MN) time for training and decoding. We explain how the decoding time can be reduced to O(M + kN), where k < M. Finally results are shown on the 1996 Hub-rf Development test, demonstrating that a 32 x 2 convolutional model can achieve performance comparable to that of a standard 64-Gaussian per state model.

引用

页码：113 / 116

页数：4

共 50 条

[1] Convolutional density estimation in hidden Markov models for speech recognition
Matsoukas, Spyros
Zavaliagkos, George
[J]. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 1999, 1 : 113 - 116
[2] HIDDEN MARKOV MODELS IN SPEECH RECOGNITION
Krajcovic, J.
Hrncar, M.
Muzikarova, E.
[J]. ADVANCES IN ELECTRICAL AND ELECTRONIC ENGINEERING, 2008, 7 (1-2) : 250 - 252
[3] Noisy Hidden Markov Models for Speech Recognition
Audhkhasi, Kartik
Osoba, Osonde
Kosko, Bart
[J]. 2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,
[4] The Application of Hidden Markov Models in Speech Recognition
Gales, Mark
Young, Steve
[J]. FOUNDATIONS AND TRENDS IN SIGNAL PROCESSING, 2007, 1 (03): : 195 - 304
[5] Hidden Markov models for speech and signal recognition
Rose, RC
Juang, BH
[J]. CONTINUOUS WAVE-FORM ANALYSIS, 1996, (45): : 137 - 152
[6] HIDDEN MARKOV-MODELS FOR SPEECH RECOGNITION
JUANG, BH
RABINER, LR
[J]. TECHNOMETRICS, 1991, 33 (03) : 251 - 272
[7] AUTOMATIC SPEECH RECOGNITION USING TIED DENSITY HIDDEN MARKOV-MODELS
EULER, S
[J]. FREQUENZ, 1992, 46 (11-12) : 274 - 279
[8] Boosted Large-Margin Estimation of Hidden Markov Models for Speech Recognition
Xu Shuangyin
Dan, Qu
[J]. PROCEEDINGS OF 2012 IEEE 11TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP) VOLS 1-3, 2012, : 545 - 548
[9] Graphical Models for Discrete Hidden Markov Models in Speech Recognition
Miguel, Antonio
Ortega, Alfonso
Buera, Luis
Lleida, Eduardo
[J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1387 - 1390
[10] Automatic speech recognition using hidden Markov models
Botros, N.M.
Teh, C.K.
[J]. Microcomputer Applications, 1994, 13 (01): : 6 - 12

← 1 2 3 4 5 →