USE OF MULTIPLE VECTOR QUANTIZATION FOR SEMICONTINUOUS-HMM SPEECH RECOGNITION

被引:2
|
作者
PEINADO, AM
SEGURA, JC
RUBIO, AJ
SANCHEZ, VE
GARCIA, P
机构
[1] Universidad de Granada, Granada
来源
关键词
ERROR RATE; HIDDEN MARKOV MODELS; PROBABILITY DENSITY FUNCTION; SPEECH MODELING; SPEECH RECOGNITION;
D O I
10.1049/ip-vis:19941576
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Although the continuous hidden Markov model (CHMM) technique seems to be the most flexible and complete tool for speech modelling, it is not always used for the implementation of speech recognition systems because of several problems related to training and computational complexity. Thus, other simpler types of HMMs, such as discrete (DHMM) or semicontinuous (SCHMM) models, are commonly utilised with very acceptable results. Also, the superiority of continuous models over these types of HMMs is not clear. The authors' group has recently introduced the multiple vector quantisation (MVQ) technique, the main feature of which is the use of one separated VQ codebook for each recognition unit. The MVQ technique applied to DHMM models generates a new HMM modelling (basic MVQ models) that allows incorporation into the recognition dynamics of the input sequence information wasted by the discrete models in the VQ process. The authors propose a new variant of HMM models that arises from the idea of applying MVQ to SCHMM models. These are SCMVQ-HMM (semicontinuous multiple vector quantisation HMM) models that use one VQ codebook per recognition unit and several quantisation candidates for each input vector. It is shown that SCMVQ modelling is formally the closest one to CHMM, although requiring even less computation than SCHMMs. After studying several implementation issues of the MVQ technique, such as which type of probability density function should be used, the authors show the superiority of SCMVQ models over other types of HMM models such as DHMMs, SCHMMs or the basic MVQs.
引用
收藏
页码:391 / 396
页数:6
相关论文
共 50 条
  • [1] Optimum HMM combined with vector quantization for hindi speech word recognition
    Bansal, Poonam
    Dev, Amita
    Jain, Shail Bala
    [J]. IETE JOURNAL OF RESEARCH, 2008, 54 (04) : 239 - 243
  • [2] Distributed TDNN-Fuzzy Vector Quantization For HMM Speech Recognition
    Debyeche, Mohamed
    Amrouche, Aderrahmane.
    Haton, Jean Paul
    [J]. 2009 INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS 2009), 2009, : 72 - +
  • [3] A New Hybrid Algorithm for Speech Recognition Based on HMM Segmentation and Learning Vector Quantization
    Katagiri, Shigeru
    Lee, Chin-Hui
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1993, 1 (04): : 421 - 430
  • [4] Multi-rate HMM quantization for speech recognition
    Vasilache, Marcel
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4341 - 4344
  • [5] Discriminative codebook design using multiple vector quantization in HMM-based speech recognizers
    Peinado, AM
    Segura, JC
    Rubio, AJ
    Garcia, P
    Perez, JL
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1996, 4 (02): : 89 - 95
  • [6] Phoneme-based vector quantization in a discrete HMM speech recognizer
    Zhang, YX
    Togneri, R
    Alder, M
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1997, 5 (01): : 26 - 32
  • [7] Matrix quantization with vector quantization error compensation for robust speech recognition
    Cong, L
    Asghar, S
    [J]. 1998 IEEE SECOND WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 1998, : 131 - 136
  • [8] Mixture of Support Vector Machines for HMM based speech recognition
    Krueger, Sven E.
    Schaffoener, Martin
    Katz, Marcel
    Andelic, Edin
    Wendemuth, Andreas
    [J]. 18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, PROCEEDINGS, 2006, : 326 - +
  • [9] Trellis encoded vector quantization for robust speech recognition
    Chou, W
    Seshadri, N
    Rahim, M
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2001 - 2004
  • [10] Kernel based clustering and vector quantization for speech recognition
    Satish, DS
    Sekhar, CC
    [J]. MACHINE LEARNING FOR SIGNAL PROCESSING XIV, 2004, : 315 - 324