Efficient speech recognition using subvector quantization and discrete-mixture HMMS

被引:20
|
作者
Digalakis, V [1 ]
Tsakalidis, S
Harizakis, C
Neumeyer, L
机构
[1] Tech Univ Crete, Dept Elect & Comp Engn, Hania 73100, Greece
[2] SRI Int, Menlo Park, CA 94025 USA
来源
COMPUTER SPEECH AND LANGUAGE | 2000年 / 14卷 / 01期
关键词
D O I
10.1006/csla.1999.0134
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper introduces a new form of observation distributions for hidden Markov models (HMMs), combining subvector quantization and mixtures of discrete distributions. Despite what is generally believed, we show that discrete-distribution HMMs can outperform continuous-density HMMs at significantly faster decoding speeds. Performance of the discrete HMMs is improved by using product-code vector quantization (VQ) and mixtures of discrete distributions. The decoding speed of the discrete HMMs is also improved by quantizing subvectors of coefficients, since this reduces the number of table lookups needed to compute the output probabilities. We present efficient training and decoding algorithms for the discrete-mixture HMMs (DMHMMs). Our experimental results in the air-travel information domain show that the high level of recognition accuracy of continuous-mixture-density HMMs (CDHMMs) can be maintained at significantly faster decoding speeds. Moreover, we show that when the same number of mixture components is used in DMHMMs and CDHMMs, the new models exhibit superior recognition performance. (C) 2000 Academic Press.
引用
收藏
页码:33 / 46
页数:14
相关论文
共 50 条
  • [21] CONTEXTUAL VECTOR QUANTIZATION FOR SPEECH RECOGNITION WITH DISCRETE HIDDEN MARKOV MODEL
    HUO, QA
    CHAN, CK
    PATTERN RECOGNITION, 1995, 28 (04) : 513 - 517
  • [22] Joint Tracking of Clean Speech and Noise Using HMMs and Particle Filters for Robust Speech Recognition
    Mushtaq, Aleem
    Lee, Chin-Hui
    2012 CONFERENCE RECORD OF THE FORTY SIXTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS (ASILOMAR), 2012, : 1618 - 1622
  • [23] COMBINING MIXTURE WEIGHT PRUNING AND QUANTIZATION FOR SMALL-FOOTPRINT SPEECH RECOGNITION
    Huggins-Daines, David
    Rudnicky, Alexander I.
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4189 - 4192
  • [24] CONTINUOUS SPEECH RECOGNITION USING A DEPENDENCY GRAMMAR AND PHONEME-BASED HMMS
    MATSUNAGA, S
    HOMMA, S
    SAGAYAMA, S
    FURUI, S
    IEICE TRANSACTIONS ON COMMUNICATIONS ELECTRONICS INFORMATION AND SYSTEMS, 1991, 74 (07): : 1826 - 1833
  • [25] Use of Gaussian Selection in large vocabulary continuous speech recognition using HMMS
    Knill, KM
    Gales, MJF
    Young, SJ
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 470 - 473
  • [26] Getting bored with HTK? Using HMMs for emotion recognition from speech signals
    Pittermann, Angela
    Pittermann, Johannes
    2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, : 704 - +
  • [27] Noise Robust Speech Recognition Based on Noise-Adapted HMMs Using Speech Feature Compensation
    Chung, Yong-Joo
    2013 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE APPLICATIONS AND TECHNOLOGIES (ACSAT), 2014, : 132 - 135
  • [28] Aphasic Speech Recognition using a Mixture of Speech Intelligibility Experts
    Perez, Matthew
    Aldeneh, Zakaria
    Provost, Emily Mower
    INTERSPEECH 2020, 2020, : 4986 - 4990
  • [30] Using Discrete Tchebichef Transform on Speech Recognition
    Ernawan, Ferda
    Noersasongko, Edi
    Abu, Nur Azman
    FOURTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2011): COMPUTER VISION AND IMAGE ANALYSIS: PATTERN RECOGNITION AND BASIC TECHNOLOGIES, 2012, 8350