Modelling asynchrony in automatic speech recognition using loosely coupled hidden Markov models

被引：0

作者：

Nock, HJ ^{[1
]}

Young, SJ ^{[1
]}

机构：

[1] Univ Cambridge, Dept Engn, Cambridge CB2 1PZ, England

来源：

COGNITIVE SCIENCE | 2002年 / 26卷 / 03期

关键词：

automatic speech recognition; pronunciation modelling; loosely coupled hidden Markov models; variational approximation;

D O I：

暂无

中图分类号：

B84 [心理学];

学科分类号：

04 ; 0402 ;

摘要：

Hidden Markov models (HMMs) have been successful for modelling the dynamics of carefully dictated speech, but their performance degrades severely when used to model conversational speech. Since speech is produced by a system of loosely coupled articulators, stochastic models explicitly representing this parallelism may have advantages for automatic speech recognition (ASR), particularly when trying to model the phonological effects inherent in casual spontaneous speech. This paper presents a preliminary feasibility study of one such model class: loosely coupled HMMs. Exact model estimation and decoding is potentially expensive, so possible approximate algorithms are also discussed. Comparison of one particular loosely coupled model on an isolated word task suggests loosely coupled HMMs merit further investigation. An approximate algorithm giving performance which is almost always statistically indistinguishable from the exact algorithm is also identified, making more extensive research computationally feasible. (C) 2002 Cognitive Science Society, Inc. All rights reserved.

引用

页码：283 / 301

页数：19

共 50 条

[41] Fuzzy hidden Markov models for speech and speaker recognition
Tran, D
Wagner, M
18TH INTERNATIONAL CONFERENCE OF THE NORTH AMERICAN FUZZY INFORMATION PROCESSING SOCIETY - NAFIPS, 1999, : 426 - 430
[42] Factor analysed hidden Markov models for speech recognition
Rosti, AVI
Gales, MJF
COMPUTER SPEECH AND LANGUAGE, 2004, 18 (02): : 181 - 200
[43] BAYESIAN SENSING HIDDEN MARKOV MODELS FOR SPEECH RECOGNITION
Saon, George
Chien, Jen-Tzung
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5056 - 5059
[44] Fuzzy hidden Markov models for speech and speaker recognition
Tran, Dat
Wagner, Michael
Annual Conference of the North American Fuzzy Information Processing Society - NAFIPS, 1999, : 426 - 430
[45] IMPROVED HIDDEN MARKOV-MODELS FOR SPEECH RECOGNITION
AUBERT, X
BOURLARD, H
KAMP, Y
WELLEKENS, CJ
PHILIPS JOURNAL OF RESEARCH, 1988, 43 (3-4) : 224 - 245
[46] REVISITING HIDDEN MARKOV MODELS FOR SPEECH EMOTION RECOGNITION
Mao, Shuiyang
Tao, Dehua
Zhang, Guangyan
Ching, P. C.
Lee, Tan
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6715 - 6719
[47] Automatic Phoneme Recognition with Segmental Hidden Markov Models
Baghdasaryan, Areg G.
Beex, A. A.
2011 CONFERENCE RECORD OF THE FORTY-FIFTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS (ASILOMAR), 2011, : 569 - 574
[48] Coupled hidden Markov models for biosignal interaction modelling
Rezek, Iead
Sykacek, Peter
Roberts, Stephen J.
IEE Conference Publication, 2000, (476): : 54 - 59
[49] Audio-visual speech fusion using coupled hidden Markov models
Chu, Stephen M.
Huang, Thomas S.
2007 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-8, 2007, : 3911 - +
[50] Audio-visual speech modeling using coupled hidden Markov models
Chu, SM
Huang, TS
2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 2009 - 2012

← 1 2 3 4 5 →