USE OF MULTIPLE VECTOR QUANTIZATION FOR SEMICONTINUOUS-HMM SPEECH RECOGNITION

被引：2

作者：

PEINADO, AM

SEGURA, JC

RUBIO, AJ

SANCHEZ, VE

GARCIA, P

机构：

[1] Universidad de Granada, Granada

来源：

IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING | 1994年 / 141卷 / 06期

关键词：

ERROR RATE; HIDDEN MARKOV MODELS; PROBABILITY DENSITY FUNCTION; SPEECH MODELING; SPEECH RECOGNITION;

D O I：

10.1049/ip-vis:19941576

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Although the continuous hidden Markov model (CHMM) technique seems to be the most flexible and complete tool for speech modelling, it is not always used for the implementation of speech recognition systems because of several problems related to training and computational complexity. Thus, other simpler types of HMMs, such as discrete (DHMM) or semicontinuous (SCHMM) models, are commonly utilised with very acceptable results. Also, the superiority of continuous models over these types of HMMs is not clear. The authors' group has recently introduced the multiple vector quantisation (MVQ) technique, the main feature of which is the use of one separated VQ codebook for each recognition unit. The MVQ technique applied to DHMM models generates a new HMM modelling (basic MVQ models) that allows incorporation into the recognition dynamics of the input sequence information wasted by the discrete models in the VQ process. The authors propose a new variant of HMM models that arises from the idea of applying MVQ to SCHMM models. These are SCMVQ-HMM (semicontinuous multiple vector quantisation HMM) models that use one VQ codebook per recognition unit and several quantisation candidates for each input vector. It is shown that SCMVQ modelling is formally the closest one to CHMM, although requiring even less computation than SCHMMs. After studying several implementation issues of the MVQ technique, such as which type of probability density function should be used, the authors show the superiority of SCMVQ models over other types of HMM models such as DHMMs, SCHMMs or the basic MVQs.

引用

页码：391 / 396

页数：6

共 50 条

[1] Optimum HMM combined with vector quantization for hindi speech word recognition
Bansal, Poonam
Dev, Amita
Jain, Shail Bala
[J]. IETE JOURNAL OF RESEARCH, 2008, 54 (04) : 239 - 243
[2] Distributed TDNN-Fuzzy Vector Quantization For HMM Speech Recognition
Debyeche, Mohamed
Amrouche, Aderrahmane.
Haton, Jean Paul
[J]. 2009 INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS 2009), 2009, : 72 - +
[3] A New Hybrid Algorithm for Speech Recognition Based on HMM Segmentation and Learning Vector Quantization
Katagiri, Shigeru
Lee, Chin-Hui
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1993, 1 (04): : 421 - 430
[4] Multi-rate HMM quantization for speech recognition
Vasilache, Marcel
[J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4341 - 4344
[5] Discriminative codebook design using multiple vector quantization in HMM-based speech recognizers
Peinado, AM
Segura, JC
Rubio, AJ
Garcia, P
Perez, JL
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1996, 4 (02): : 89 - 95
[6] Phoneme-based vector quantization in a discrete HMM speech recognizer
Zhang, YX
Togneri, R
Alder, M
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1997, 5 (01): : 26 - 32
[7] Matrix quantization with vector quantization error compensation for robust speech recognition
Cong, L
Asghar, S
[J]. 1998 IEEE SECOND WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 1998, : 131 - 136
[8] Mixture of Support Vector Machines for HMM based speech recognition
Krueger, Sven E.
Schaffoener, Martin
Katz, Marcel
Andelic, Edin
Wendemuth, Andreas
[J]. 18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, PROCEEDINGS, 2006, : 326 - +
[9] Trellis encoded vector quantization for robust speech recognition
Chou, W
Seshadri, N
Rahim, M
[J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2001 - 2004
[10] Kernel based clustering and vector quantization for speech recognition
Satish, DS
Sekhar, CC
[J]. MACHINE LEARNING FOR SIGNAL PROCESSING XIV, 2004, : 315 - 324

← 1 2 3 4 5 →