Modelling asynchrony in automatic speech recognition using loosely coupled hidden Markov models

被引:0
|
作者
Nock, HJ [1 ]
Young, SJ [1 ]
机构
[1] Univ Cambridge, Dept Engn, Cambridge CB2 1PZ, England
关键词
automatic speech recognition; pronunciation modelling; loosely coupled hidden Markov models; variational approximation;
D O I
暂无
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Hidden Markov models (HMMs) have been successful for modelling the dynamics of carefully dictated speech, but their performance degrades severely when used to model conversational speech. Since speech is produced by a system of loosely coupled articulators, stochastic models explicitly representing this parallelism may have advantages for automatic speech recognition (ASR), particularly when trying to model the phonological effects inherent in casual spontaneous speech. This paper presents a preliminary feasibility study of one such model class: loosely coupled HMMs. Exact model estimation and decoding is potentially expensive, so possible approximate algorithms are also discussed. Comparison of one particular loosely coupled model on an isolated word task suggests loosely coupled HMMs merit further investigation. An approximate algorithm giving performance which is almost always statistically indistinguishable from the exact algorithm is also identified, making more extensive research computationally feasible. (C) 2002 Cognitive Science Society, Inc. All rights reserved.
引用
收藏
页码:283 / 301
页数:19
相关论文
共 50 条
  • [41] Fuzzy hidden Markov models for speech and speaker recognition
    Tran, D
    Wagner, M
    18TH INTERNATIONAL CONFERENCE OF THE NORTH AMERICAN FUZZY INFORMATION PROCESSING SOCIETY - NAFIPS, 1999, : 426 - 430
  • [42] Factor analysed hidden Markov models for speech recognition
    Rosti, AVI
    Gales, MJF
    COMPUTER SPEECH AND LANGUAGE, 2004, 18 (02): : 181 - 200
  • [43] BAYESIAN SENSING HIDDEN MARKOV MODELS FOR SPEECH RECOGNITION
    Saon, George
    Chien, Jen-Tzung
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5056 - 5059
  • [44] Fuzzy hidden Markov models for speech and speaker recognition
    Tran, Dat
    Wagner, Michael
    Annual Conference of the North American Fuzzy Information Processing Society - NAFIPS, 1999, : 426 - 430
  • [45] IMPROVED HIDDEN MARKOV-MODELS FOR SPEECH RECOGNITION
    AUBERT, X
    BOURLARD, H
    KAMP, Y
    WELLEKENS, CJ
    PHILIPS JOURNAL OF RESEARCH, 1988, 43 (3-4) : 224 - 245
  • [46] REVISITING HIDDEN MARKOV MODELS FOR SPEECH EMOTION RECOGNITION
    Mao, Shuiyang
    Tao, Dehua
    Zhang, Guangyan
    Ching, P. C.
    Lee, Tan
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6715 - 6719
  • [47] Automatic Phoneme Recognition with Segmental Hidden Markov Models
    Baghdasaryan, Areg G.
    Beex, A. A.
    2011 CONFERENCE RECORD OF THE FORTY-FIFTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS (ASILOMAR), 2011, : 569 - 574
  • [48] Coupled hidden Markov models for biosignal interaction modelling
    Rezek, Iead
    Sykacek, Peter
    Roberts, Stephen J.
    IEE Conference Publication, 2000, (476): : 54 - 59
  • [49] Audio-visual speech fusion using coupled hidden Markov models
    Chu, Stephen M.
    Huang, Thomas S.
    2007 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-8, 2007, : 3911 - +
  • [50] Audio-visual speech modeling using coupled hidden Markov models
    Chu, SM
    Huang, TS
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 2009 - 2012