EXPLOITING SPARSITY IN STRANDED HIDDEN MARKOV MODELS FOR AUTOMATIC SPEECH RECOGNITION

被引:0
|
作者
Zhao, Yong [1 ]
Juang, Biing-Hwang [1 ]
机构
[1] Georgia Inst Technol, Dept Elect & Comp Engn, Atlanta, GA 30332 USA
关键词
Speech recognition; hidden Markov model; MIXTURE;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We have recently proposed the stranded HMM to achieve a more accurate representation of heterogeneous data. As opposed to the regular Gaussian mixture HMM, the stranded HMM explicitly models the relationships among the mixture components. The transitions among mixture components encode possible trajectories of acoustic features for speech units. Accurately representing the underlying transition structure is crucial for the stranded HMM to produce an optimal recognition performance. In this paper, we propose to learn the stranded HMM structure by imposing sparsity constraints. In particular, entropic priors are incorporated in the maximum a posteriori (MAP) estimation of the mixture transition matrices. The experimental results showed that a significant improvement in model sparsity can be obtained with a slight sacrifice of the recognition accuracy.
引用
收藏
页码:1623 / 1625
页数:3
相关论文
共 50 条
  • [1] Automatic speech recognition using hidden Markov models
    Botros, N.M.
    Teh, C.K.
    [J]. Microcomputer Applications, 1994, 13 (01): : 6 - 12
  • [2] STRANDED GAUSSIAN MIXTURE HIDDEN MARKOV MODELS FOR ROBUST SPEECH RECOGNITION
    Zhao, Yong
    Juang, Biing-Hwang
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4301 - 4304
  • [3] HIDDEN MARKOV-MODELS FOR AUTOMATIC SPEECH RECOGNITION - THEORY AND APPLICATION
    COX, SJ
    [J]. BRITISH TELECOM TECHNOLOGY JOURNAL, 1988, 6 (02): : 105 - 115
  • [4] HIDDEN MARKOV MODELS IN SPEECH RECOGNITION
    Krajcovic, J.
    Hrncar, M.
    Muzikarova, E.
    [J]. ADVANCES IN ELECTRICAL AND ELECTRONIC ENGINEERING, 2008, 7 (1-2) : 250 - 252
  • [5] TEMPORAL CONTROL IMPROVEMENT OF HIDDEN MARKOV-MODELS FOR AUTOMATIC SPEECH RECOGNITION
    DOURSSENAC, C
    [J]. MICROPROCESSING AND MICROPROGRAMMING, 1991, 32 (1-5): : 549 - 556
  • [6] AUTOMATIC RECOGNITION OF KEYWORDS IN UNCONSTRAINED SPEECH USING HIDDEN MARKOV-MODELS
    WILPON, JG
    RABINER, LR
    LEE, CH
    GOLDMAN, ER
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1990, 38 (11): : 1870 - 1878
  • [7] AUTOMATIC SPEECH RECOGNITION USING TIED DENSITY HIDDEN MARKOV-MODELS
    EULER, S
    [J]. FREQUENZ, 1992, 46 (11-12) : 274 - 279
  • [8] CONTINUOUSLY VARIABLE DURATION HIDDEN MARKOV MODELS FOR AUTOMATIC SPEECH RECOGNITION.
    Levinson, S.E.
    [J]. Computer Speech and Language, 1986, 1 (01): : 29 - 45
  • [9] Hidden Markov models merging acoustic and articulatory information to automatic speech recognition
    Jacob, B
    Senac, C
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2313 - 2315
  • [10] On the robust incorporation of formant features into hidden Markov models for automatic speech recognition
    Garner, PN
    Holmes, WJ
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 1 - 4