Prediction, Bayesian inference and feedback in speech recognition

被引：82

作者：

Norris, Dennis ^{[1
]}

McQueen, James M. ^{[2
,3
]}

Cutler, Anne ^{[3
,4
]}

机构：

[1] MRC, Cognit & Brain Sci Unit, Cambridge, England

[2] Radboud Univ Nijmegen, Donders Inst Brain Cognit & Behav, NL-6525 ED Nijmegen, Netherlands

[3] Max Planck Inst Psycholinguist, Nijmegen, Netherlands

[4] Univ Western Sydney, MARCS Inst, Penrith, NSW 2751, Australia

来源：

LANGUAGE COGNITION AND NEUROSCIENCE | 2016年 / 31卷 / 01期

关键词：

Speech recognition; Bayesian inference; feedback; prediction; TOP-DOWN INFLUENCES; AUDITORY WORD RECOGNITION; SPOKEN-LANGUAGE; PHONETIC CATEGORIZATION; INTERACTIVE ACTIVATION; CORTICAL ORGANIZATION; NEURAL-NETWORKS; REACTION-TIME; PERCEPTION; MODEL;

D O I：

10.1080/23273798.2015.1081703

中图分类号：

R36 [病理学]; R76 [耳鼻咽喉科学];

学科分类号：

100104 ; 100213 ;

摘要：

Speech perception involves prediction, but how is that prediction implemented? In cognitive models prediction has often been taken to imply that there is feedback of activation from lexical to pre-lexical processes as implemented in interactive-activation models (IAMs). We show that simple activation feedback does not actually improve speech recognition. However, other forms of feedback can be beneficial. In particular, feedback can enable the listener to adapt to changing input, and can potentially help the listener to recognise unusual input, or recognise speech in the presence of competing sounds. The common feature of these helpful forms of feedback is that they are all ways of optimising the performance of speech recognition using Bayesian inference. That is, listeners make predictions about speech because speech recognition is optimal in the sense captured in Bayesian models.

引用

页码：4 / 18

页数：15

共 50 条

[1] Bayesian network structures and inference techniques for automatic speech recognition
Zweig, G
[J]. COMPUTER SPEECH AND LANGUAGE, 2003, 17 (2-3): : 173 - 193
[2] Robust speech recognition based on a Bayesian prediction approach
Jiang, H
Hirose, K
Huo, Q
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1999, 7 (04): : 426 - 440
[3] Composite decision by Bayesian inference in distant-talking speech recognition
Ji, Mikyong
Kim, Sungtak
Kim, Hoirin
[J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2006, 4188 : 463 - 470
[4] A Bayesian prediction approach to robust speech recognition and online environmental learning
Chien, JT
[J]. SPEECH COMMUNICATION, 2002, 37 (3-4) : 321 - 334
[5] From Birdsong to Human Speech Recognition: Bayesian Inference on a Hierarchy of Nonlinear Dynamical Systems
Yildiz, Izzet B.
von Kriegstein, Katharina
Kiebel, Stefan J.
[J]. PLOS COMPUTATIONAL BIOLOGY, 2013, 9 (09)
[6] Bayesian probabilistic inference for target recognition
Chang, KC
Liu, J
Zhou, J
[J]. SIGNAL PROCESSING, SENSOR FUSION, AND TARGET RECOGNITION V, 1996, 2755 : 158 - 165
[7] A Bayesian inference model for speech localization (L)
Escolano, Jose
Perez-Lorenzo, Jose M.
Xiang, Ning
Cobos, Maximo
Lopez, Jose J.
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2012, 132 (03): : 1257 - 1260
[8] BAYESIAN DISCRIMINATIVE ADAPTATION FOR SPEECH RECOGNITION
Raut, C. K.
Gales, M. J. F.
[J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4361 - 4364
[9] Speech recognition with dynamic Bayesian networks
Zweig, G
Russell, S
[J]. FIFTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-98) AND TENTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICAL INTELLIGENCE (IAAI-98) - PROCEEDINGS, 1998, : 173 - 180
[10] Bayesian inference in mixtures-of-experts and hierarchical mixtures-of-experts models with an application to speech recognition
Peng, FC
Jacobs, RA
Tanner, MA
[J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1996, 91 (435) : 953 - 960

← 1 2 3 4 5 →