Modelling asynchrony in automatic speech recognition using loosely coupled hidden Markov models

被引：0

作者：

Nock, HJ ^{[1
]}

Young, SJ ^{[1
]}

机构：

[1] Univ Cambridge, Dept Engn, Cambridge CB2 1PZ, England

来源：

COGNITIVE SCIENCE | 2002年 / 26卷 / 03期

关键词：

automatic speech recognition; pronunciation modelling; loosely coupled hidden Markov models; variational approximation;

D O I：

暂无

中图分类号：

B84 [心理学];

学科分类号：

04 ; 0402 ;

摘要：

Hidden Markov models (HMMs) have been successful for modelling the dynamics of carefully dictated speech, but their performance degrades severely when used to model conversational speech. Since speech is produced by a system of loosely coupled articulators, stochastic models explicitly representing this parallelism may have advantages for automatic speech recognition (ASR), particularly when trying to model the phonological effects inherent in casual spontaneous speech. This paper presents a preliminary feasibility study of one such model class: loosely coupled HMMs. Exact model estimation and decoding is potentially expensive, so possible approximate algorithms are also discussed. Comparison of one particular loosely coupled model on an isolated word task suggests loosely coupled HMMs merit further investigation. An approximate algorithm giving performance which is almost always statistically indistinguishable from the exact algorithm is also identified, making more extensive research computationally feasible. (C) 2002 Cognitive Science Society, Inc. All rights reserved.

引用

页码：283 / 301

页数：19

共 50 条

[21] On the robust incorporation of formant features into hidden Markov models for automatic speech recognition
Garner, PN
Holmes, WJ
PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 1 - 4
[22] The Application of Hidden Markov Models in Speech Recognition
Gales, Mark
Young, Steve
FOUNDATIONS AND TRENDS IN SIGNAL PROCESSING, 2007, 1 (03): : 195 - 304
[23] Noisy Hidden Markov Models for Speech Recognition
Audhkhasi, Kartik
Osoba, Osonde
Kosko, Bart
2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,
[24] Hidden Markov models for speech and signal recognition
Rose, RC
Juang, BH
CONTINUOUS WAVE-FORM ANALYSIS, 1996, (45): : 137 - 152
[25] HIDDEN MARKOV-MODELS FOR SPEECH RECOGNITION
JUANG, BH
RABINER, LR
TECHNOMETRICS, 1991, 33 (03) : 251 - 272
[26] Visual speech recognition using Active Shape Models and Hidden Markov Models
Luettin, J
Thacker, NA
Beet, SW
1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 817 - 820
[27] Visual speech recognition using motion features and hidden Markov models
Yau, Wai Chee
Kumar, Dinesh Kant
Weghorn, Hans
COMPUTER ANALYSIS OF IMAGES AND PATTERNS, PROCEEDINGS, 2007, 4673 : 832 - 839
[28] Telephone speech recognition using neural networks and hidden Markov models
Yuk, D
Flanagan, J
ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 157 - 160
[29] Speech recognition on an FPA using discrete and continuous hidden Markov models
Melnikoff, SJ
Quigley, SF
Russell, MJ
FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS, PROCEEDINGS: RECONFIGURABLE COMPUTING IS GOING MAINSTREAM, 2002, 2438 : 202 - 211
[30] Stereophonic speech recognition in noise using compensated hidden Markov models
Brookes, DM
Leung, MH
ELECTRONICS LETTERS, 1998, 34 (19) : 1827 - 1829

← 1 2 3 4 5 →