Prediction, Bayesian inference and feedback in speech recognition

被引:82
|
作者
Norris, Dennis [1 ]
McQueen, James M. [2 ,3 ]
Cutler, Anne [3 ,4 ]
机构
[1] MRC, Cognit & Brain Sci Unit, Cambridge, England
[2] Radboud Univ Nijmegen, Donders Inst Brain Cognit & Behav, NL-6525 ED Nijmegen, Netherlands
[3] Max Planck Inst Psycholinguist, Nijmegen, Netherlands
[4] Univ Western Sydney, MARCS Inst, Penrith, NSW 2751, Australia
关键词
Speech recognition; Bayesian inference; feedback; prediction; TOP-DOWN INFLUENCES; AUDITORY WORD RECOGNITION; SPOKEN-LANGUAGE; PHONETIC CATEGORIZATION; INTERACTIVE ACTIVATION; CORTICAL ORGANIZATION; NEURAL-NETWORKS; REACTION-TIME; PERCEPTION; MODEL;
D O I
10.1080/23273798.2015.1081703
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Speech perception involves prediction, but how is that prediction implemented? In cognitive models prediction has often been taken to imply that there is feedback of activation from lexical to pre-lexical processes as implemented in interactive-activation models (IAMs). We show that simple activation feedback does not actually improve speech recognition. However, other forms of feedback can be beneficial. In particular, feedback can enable the listener to adapt to changing input, and can potentially help the listener to recognise unusual input, or recognise speech in the presence of competing sounds. The common feature of these helpful forms of feedback is that they are all ways of optimising the performance of speech recognition using Bayesian inference. That is, listeners make predictions about speech because speech recognition is optimal in the sense captured in Bayesian models.
引用
收藏
页码:4 / 18
页数:15
相关论文
共 50 条
  • [1] Bayesian network structures and inference techniques for automatic speech recognition
    Zweig, G
    [J]. COMPUTER SPEECH AND LANGUAGE, 2003, 17 (2-3): : 173 - 193
  • [2] Robust speech recognition based on a Bayesian prediction approach
    Jiang, H
    Hirose, K
    Huo, Q
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1999, 7 (04): : 426 - 440
  • [3] Composite decision by Bayesian inference in distant-talking speech recognition
    Ji, Mikyong
    Kim, Sungtak
    Kim, Hoirin
    [J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2006, 4188 : 463 - 470
  • [4] A Bayesian prediction approach to robust speech recognition and online environmental learning
    Chien, JT
    [J]. SPEECH COMMUNICATION, 2002, 37 (3-4) : 321 - 334
  • [5] From Birdsong to Human Speech Recognition: Bayesian Inference on a Hierarchy of Nonlinear Dynamical Systems
    Yildiz, Izzet B.
    von Kriegstein, Katharina
    Kiebel, Stefan J.
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2013, 9 (09)
  • [6] Bayesian probabilistic inference for target recognition
    Chang, KC
    Liu, J
    Zhou, J
    [J]. SIGNAL PROCESSING, SENSOR FUSION, AND TARGET RECOGNITION V, 1996, 2755 : 158 - 165
  • [7] A Bayesian inference model for speech localization (L)
    Escolano, Jose
    Perez-Lorenzo, Jose M.
    Xiang, Ning
    Cobos, Maximo
    Lopez, Jose J.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2012, 132 (03): : 1257 - 1260
  • [8] BAYESIAN DISCRIMINATIVE ADAPTATION FOR SPEECH RECOGNITION
    Raut, C. K.
    Gales, M. J. F.
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4361 - 4364
  • [9] Speech recognition with dynamic Bayesian networks
    Zweig, G
    Russell, S
    [J]. FIFTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-98) AND TENTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICAL INTELLIGENCE (IAAI-98) - PROCEEDINGS, 1998, : 173 - 180
  • [10] Bayesian inference in mixtures-of-experts and hierarchical mixtures-of-experts models with an application to speech recognition
    Peng, FC
    Jacobs, RA
    Tanner, MA
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1996, 91 (435) : 953 - 960