Modelling asynchrony in automatic speech recognition using loosely coupled hidden Markov models

被引:0
|
作者
Nock, HJ [1 ]
Young, SJ [1 ]
机构
[1] Univ Cambridge, Dept Engn, Cambridge CB2 1PZ, England
关键词
automatic speech recognition; pronunciation modelling; loosely coupled hidden Markov models; variational approximation;
D O I
暂无
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Hidden Markov models (HMMs) have been successful for modelling the dynamics of carefully dictated speech, but their performance degrades severely when used to model conversational speech. Since speech is produced by a system of loosely coupled articulators, stochastic models explicitly representing this parallelism may have advantages for automatic speech recognition (ASR), particularly when trying to model the phonological effects inherent in casual spontaneous speech. This paper presents a preliminary feasibility study of one such model class: loosely coupled HMMs. Exact model estimation and decoding is potentially expensive, so possible approximate algorithms are also discussed. Comparison of one particular loosely coupled model on an isolated word task suggests loosely coupled HMMs merit further investigation. An approximate algorithm giving performance which is almost always statistically indistinguishable from the exact algorithm is also identified, making more extensive research computationally feasible. (C) 2002 Cognitive Science Society, Inc. All rights reserved.
引用
收藏
页码:283 / 301
页数:19
相关论文
共 50 条
  • [21] On the robust incorporation of formant features into hidden Markov models for automatic speech recognition
    Garner, PN
    Holmes, WJ
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 1 - 4
  • [22] The Application of Hidden Markov Models in Speech Recognition
    Gales, Mark
    Young, Steve
    FOUNDATIONS AND TRENDS IN SIGNAL PROCESSING, 2007, 1 (03): : 195 - 304
  • [23] Noisy Hidden Markov Models for Speech Recognition
    Audhkhasi, Kartik
    Osoba, Osonde
    Kosko, Bart
    2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,
  • [24] Hidden Markov models for speech and signal recognition
    Rose, RC
    Juang, BH
    CONTINUOUS WAVE-FORM ANALYSIS, 1996, (45): : 137 - 152
  • [25] HIDDEN MARKOV-MODELS FOR SPEECH RECOGNITION
    JUANG, BH
    RABINER, LR
    TECHNOMETRICS, 1991, 33 (03) : 251 - 272
  • [26] Visual speech recognition using Active Shape Models and Hidden Markov Models
    Luettin, J
    Thacker, NA
    Beet, SW
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 817 - 820
  • [27] Visual speech recognition using motion features and hidden Markov models
    Yau, Wai Chee
    Kumar, Dinesh Kant
    Weghorn, Hans
    COMPUTER ANALYSIS OF IMAGES AND PATTERNS, PROCEEDINGS, 2007, 4673 : 832 - 839
  • [28] Telephone speech recognition using neural networks and hidden Markov models
    Yuk, D
    Flanagan, J
    ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 157 - 160
  • [29] Speech recognition on an FPA using discrete and continuous hidden Markov models
    Melnikoff, SJ
    Quigley, SF
    Russell, MJ
    FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS, PROCEEDINGS: RECONFIGURABLE COMPUTING IS GOING MAINSTREAM, 2002, 2438 : 202 - 211
  • [30] Stereophonic speech recognition in noise using compensated hidden Markov models
    Brookes, DM
    Leung, MH
    ELECTRONICS LETTERS, 1998, 34 (19) : 1827 - 1829