The Complementary Roles of Auditory and Motor Information Evaluated in a Bayesian Perceptuo-Motor Model of Speech Perception

被引:14
|
作者
Laurent, Raphael [1 ,2 ]
Barnaud, Marie-Lou [1 ,2 ]
Schwartz, Jean-Luc [1 ,2 ]
Bessiere, Pierre [3 ,4 ]
Diard, Julien [2 ,5 ]
机构
[1] Univ Grenoble Alpes, GIPSA Lab, Grenoble, France
[2] CNRS, Grenoble, France
[3] Univ Paris 06, Sorbonne Univ, Inst Syst Intelligents & Robot, Paris, France
[4] CNRS, Paris, France
[5] Univ Grenoble Alpes, Lab Psychol & Neurocognit, Grenoble, France
基金
欧洲研究理事会;
关键词
speech perception; computational modeling; sensory-motor interactions; adverse conditions; plosive invariance; LANGUAGE-ACQUISITION; AUDIOVISUAL SPEECH; FUNCTIONAL-ORGANIZATION; COMPENSATION STRATEGIES; GRASP REPRESENTATIONS; ARTICULATORY FEATURES; CATEGORIES INSIGHTS; LOCUS EQUATIONS; PREMOTOR CORTEX; NATIVE-LANGUAGE;
D O I
10.1037/rev0000069
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
There is a consensus concerning the view that both auditory and motor representations intervene in the perceptual processing of speech units. However, the question of the functional role of each of these systems remains seldom addressed and poorly understood. We capitalized on the formal framework of Bayesian Programming to develop COSMO (Communicating Objects using SensoryMotor Operations), an integrative model that allows principled comparisons of purely motor or purely auditory implementations of a speech perception task and tests the gain of efficiency provided by their Bayesian fusion. Here, we show 3 main results: (a) In a set of precisely defined "perfect conditions,"auditory and motor theories of speech perception are indistinguishable; (b) When a learning process that mimics speech development is introduced into COSMO, it departs from these perfect conditions. Then auditory recognition becomes more efficient than motor recognition in dealing with learned stimuli, while motor recognition is more efficient in adverse conditions. We interpret this result as a general "auditory-narrowband versus motor-wideband" property; and (c) Simulations of plosive-vowel syllable recognition reveal possible cues from motor recognition for the invariant specification of the place of plosive articulation in context that are lacking in the auditory pathway. This provides COSMO with a second property, where auditory cues would be more efficient for vowel decoding and motor cues for plosive articulation decoding. These simulations provide several predictions, which are in good agreement with experimental data and suggest that there is natural complementarity between auditory and motor processing within a perceptuomotor theory of speech perception.
引用
收藏
页码:572 / 602
页数:31
相关论文
共 38 条