Genones: Generalized mixture tying in continuous hidden Markov model-based speech recognizers

被引:62
|
作者
Digalakis, VV [1 ]
Monaco, P [1 ]
Murveit, H [1 ]
机构
[1] SRI INT, SPEECH TECHNOL & RES LAB, MENLO PK, CA 94025 USA
来源
关键词
D O I
10.1109/89.506931
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
An algorithm is proposed that achieves a good tradeoff between modeling resolution and robustness by using a new, general scheme for tying of mixture components in continuous mixture-density hidden Markov model (HMM)-based speech recognizers. The sets of HMM states that share the same mixture components are determined automatically using agglomerative clustering techniques. Experimental results on ARPA's Wall Street Journal corpus show that this scheme reduces errors by 25% over typical tied-mixture systems. New fast algorithms for computing Gaussian likelihoods-the most time-consuming aspect of continuous-density HMM systems-are also presented. These new algorithms significantly reduce the number of Gaussian densities that are evaluated with little or no impact on speech recognition accuracy.
引用
收藏
页码:281 / 289
页数:9
相关论文
共 50 条
  • [1] Genones: generalized mixture tying in continuous hidden Markov model-based speech recognizers
    Technical Univ of Crete, Chania, Greece
    [J]. IEEE Trans Speech Audio Process, 4 (281-289):
  • [2] STATE CLUSTERING IN HIDDEN MARKOV MODEL-BASED CONTINUOUS SPEECH RECOGNITION
    YOUNG, SJ
    WOODLAND, PC
    [J]. COMPUTER SPEECH AND LANGUAGE, 1994, 8 (04): : 369 - 383
  • [3] Deterministically annealed design of hidden Markov model speech recognizers
    Rao, AV
    Rose, K
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (02): : 111 - 126
  • [4] Hidden Markov model-based speech emotion recognition
    Schuller, B
    Rigoll, G
    Lang, M
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS: SPEECH II; INDUSTRY TECHNOLOGY TRACKS; DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS; NEURAL NETWORKS FOR SIGNAL PROCESSING, 2003, : 1 - 4
  • [5] Hidden Markov model-based speech emotion recognition
    Schuller, B
    Rigoll, G
    Lang, M
    [J]. 2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I, PROCEEDINGS, 2003, : 401 - 404
  • [6] KEYWORD DETECTION IN CONVERSATIONAL SPEECH UTTERANCES USING HIDDEN MARKOV MODEL-BASED CONTINUOUS SPEECH RECOGNITION
    ROSE, RC
    [J]. COMPUTER SPEECH AND LANGUAGE, 1995, 9 (04): : 309 - 333
  • [7] Rayleigh mixture model-based hidden Markov modeling and estimation of noise in noisy speech signals
    Sorensen, Karsten Vandborg
    Andersen, Soren Vang
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (03): : 901 - 917
  • [8] A hidden semi-Markov model-based speech synthesis system
    Zen, Heiga
    Tokuda, Keiichi
    Masuko, Takashi
    Kobayasih, Takao
    Kitamura, Tadashi
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2007, E90D (05) : 825 - 834
  • [9] STATE CLUSTERING IN HIDDEN MARKOV MODEL-BASED CONTINUOUS SPEECH RECOGNITION, (VOL 8, PG 369, 1994)
    YOUNG, SJ
    WOODLAND, PC
    [J]. COMPUTER SPEECH AND LANGUAGE, 1995, 9 (01): : 105 - 105
  • [10] Hidden Markov model-based speech recognition with intermediate wavelet transform domains
    Singh, R
    Davis, K
    Rao, PVS
    [J]. COMPUTER SPEECH AND LANGUAGE, 1997, 11 (03): : 253 - 273