Efficient speech recognition using subvector quantization and discrete-mixture HMMS

被引：20

作者：

Digalakis, V ^{[1
]}

Tsakalidis, S

Harizakis, C

Neumeyer, L

机构：

[1] Tech Univ Crete, Dept Elect & Comp Engn, Hania 73100, Greece

[2] SRI Int, Menlo Park, CA 94025 USA

来源：

COMPUTER SPEECH AND LANGUAGE | 2000年 / 14卷 / 01期

关键词：

D O I：

10.1006/csla.1999.0134

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper introduces a new form of observation distributions for hidden Markov models (HMMs), combining subvector quantization and mixtures of discrete distributions. Despite what is generally believed, we show that discrete-distribution HMMs can outperform continuous-density HMMs at significantly faster decoding speeds. Performance of the discrete HMMs is improved by using product-code vector quantization (VQ) and mixtures of discrete distributions. The decoding speed of the discrete HMMs is also improved by quantizing subvectors of coefficients, since this reduces the number of table lookups needed to compute the output probabilities. We present efficient training and decoding algorithms for the discrete-mixture HMMs (DMHMMs). Our experimental results in the air-travel information domain show that the high level of recognition accuracy of continuous-mixture-density HMMs (CDHMMs) can be maintained at significantly faster decoding speeds. Moreover, we show that when the same number of mixture components is used in DMHMMs and CDHMMs, the new models exhibit superior recognition performance. (C) 2000 Academic Press.

引用

页码：33 / 46

页数：14

共 50 条

[21] CONTEXTUAL VECTOR QUANTIZATION FOR SPEECH RECOGNITION WITH DISCRETE HIDDEN MARKOV MODEL
HUO, QA
CHAN, CK
PATTERN RECOGNITION, 1995, 28 (04) : 513 - 517
[22] Joint Tracking of Clean Speech and Noise Using HMMs and Particle Filters for Robust Speech Recognition
Mushtaq, Aleem
Lee, Chin-Hui
2012 CONFERENCE RECORD OF THE FORTY SIXTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS (ASILOMAR), 2012, : 1618 - 1622
[23] COMBINING MIXTURE WEIGHT PRUNING AND QUANTIZATION FOR SMALL-FOOTPRINT SPEECH RECOGNITION
Huggins-Daines, David
Rudnicky, Alexander I.
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4189 - 4192
[24] CONTINUOUS SPEECH RECOGNITION USING A DEPENDENCY GRAMMAR AND PHONEME-BASED HMMS
MATSUNAGA, S
HOMMA, S
SAGAYAMA, S
FURUI, S
IEICE TRANSACTIONS ON COMMUNICATIONS ELECTRONICS INFORMATION AND SYSTEMS, 1991, 74 (07): : 1826 - 1833
[25] Use of Gaussian Selection in large vocabulary continuous speech recognition using HMMS
Knill, KM
Gales, MJF
Young, SJ
ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 470 - 473
[26] Getting bored with HTK? Using HMMs for emotion recognition from speech signals
Pittermann, Angela
Pittermann, Johannes
2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, : 704 - +
[27] Noise Robust Speech Recognition Based on Noise-Adapted HMMs Using Speech Feature Compensation
Chung, Yong-Joo
2013 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE APPLICATIONS AND TECHNOLOGIES (ACSAT), 2014, : 132 - 135
[28] Aphasic Speech Recognition using a Mixture of Speech Intelligibility Experts
Perez, Matthew
Aldeneh, Zakaria
Provost, Emily Mower
INTERSPEECH 2020, 2020, : 4986 - 4990
[29] Fuzzy Vector Quantization on the Modeling of Discrete Hidden Markov Model for Speech Recognition
Pan, Shing-Tai
INTERNATIONAL JOURNAL OF FUZZY SYSTEMS, 2011, 13 (02) : 130 - 139
[30] Using Discrete Tchebichef Transform on Speech Recognition
Ernawan, Ferda
Noersasongko, Edi
Abu, Nur Azman
FOURTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2011): COMPUTER VISION AND IMAGE ANALYSIS: PATTERN RECOGNITION AND BASIC TECHNOLOGIES, 2012, 8350

← 1 2 3 4 5 →