An improved maximum model distance approach for HMM-based speech recognition systems

被引:11
|
作者
He, QH [1 ]
Kwong, S [1 ]
Man, KF [1 ]
Tang, KS [1 ]
机构
[1] City Univ Hong Kong, Dept Comp Sci, Kowloon, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1016/S0031-3203(99)00144-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes an improved maximum model distance (IMMD) approach for HMM-based speech recognition systems based on our previous work [S. Kwong, Q.H. He, K.F. Man, K.S. Tang. A maximum model distance approach for HMM-based speech recognition, Pattern Recognition 31 (3) (1998) 219-229]. It defines a more realistic model distance definition for HMM training, and utilizes the limited training data in a more effective manner. Discriminative information contained in the training data was used to improve the performance of the recognizer. HMM parameter adjustment rules were induced in details. Theoretical and practical issues concerning this approach are also discussed and investigated in this paper. Both isolated word and continuous speech recognition experiments showed that a significant error reduction could be achieved by IMMD when compared with the maximum model distance (MMD) criterion and other training methods using the minimum classification error (MCE) and the maximum mutual information (MMI) approaches. (C) 2000 Pattern Recognition Society. Published by Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:1749 / 1758
页数:10
相关论文
共 50 条
  • [1] A maximum model distance approach for HMM-based speech recognition
    Kwong, S
    He, QH
    Man, KF
    Tang, KS
    [J]. PATTERN RECOGNITION, 1998, 31 (03) : 219 - 229
  • [2] An improved training algorithm in HMM-based speech recognition
    Li, GJ
    Huong, TY
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1057 - 1060
  • [3] Maximum likelihood linear transformations for HMM-based speech recognition
    Cambridge Univ Engineering Dep, Cambridge, United Kingdom
    [J]. Comput Speech Lang, 2 (75-98):
  • [4] Maximum likelihood linear transformations for HMM-based speech recognition
    Gales, MJF
    [J]. COMPUTER SPEECH AND LANGUAGE, 1998, 12 (02): : 75 - 98
  • [5] An HMM-based speech recognition IC
    Han, W
    Hon, KW
    Chan, CF
    Lee, T
    Choy, CS
    Pun, KP
    Ching, PC
    [J]. PROCEEDINGS OF THE 2003 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL II: COMMUNICATIONS-MULTIMEDIA SYSTEMS & APPLICATIONS, 2003, : 744 - 747
  • [6] From Stochastic Speech Recognition to Understanding: An HMM-Based approach
    Boda, PP
    [J]. 1997 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, PROCEEDINGS, 1997, : 57 - 64
  • [7] A novel approach to HMM-based speech recognition systems using particle swarm optimization
    Najkar, Negin
    Razzazi, Farbod
    Sameti, Hossein
    [J]. MATHEMATICAL AND COMPUTER MODELLING, 2010, 52 (11-12) : 1910 - 1920
  • [8] HMM-based threshold model approach for gesture recognition
    Microsoft Korea, Seoul, Korea, Republic of
    [J]. IEEE Trans Pattern Anal Mach Intell, 10 (961-973):
  • [9] Peripheral features for HMM-based speech recognition
    Fukuda, T
    Takigawa, M
    Nitta, T
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 129 - 132
  • [10] An HMM-based threshold model approach for gesture recognition
    Lee, HK
    Kim, JH
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1999, 21 (10) : 961 - 973