Modeling the Development of Audiovisual Cue Integration in Speech Perception

被引:5
|
作者
Getz, Laura M. [1 ]
Nordeen, Elke R. [1 ]
Vrabic, Sarah C. [1 ]
Toscano, Joseph C. [1 ]
机构
[1] Villanova Univ, Dept Psychol, Villanova, PA 19085 USA
来源
BRAIN SCIENCES | 2017年 / 7卷 / 03期
关键词
speech perception; speech development; multimodal representations; audiovisual cues; statistical learning; mixture of Gaussians; cue weighting; VISUAL SPEECH; PHONETIC INFORMATION; INTERMODAL REPRESENTATION; ACOUSTIC CHARACTERISTICS; INFANT PERCEPTION; CROSS-LANGUAGE; CHILDREN; DISCRIMINATION; SENSITIVITY; SYNCHRONY;
D O I
10.3390/brainsci7030032
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Adult speech perception is generally enhanced when information is provided from multiple modalities. In contrast, infants do not appear to benefit from combining auditory and visual speech information early in development. This is true despite the fact that both modalities are important to speech comprehension even at early stages of language acquisition. How then do listeners learn how to process auditory and visual information as part of a unified signal? In the auditory domain, statistical learning processes provide an excellent mechanism for acquiring phonological categories. Is this also true for the more complex problem of acquiring audiovisual correspondences, which require the learner to integrate information from multiple modalities? In this paper, we present simulations using Gaussian mixture models (GMMs) that learn cue weights and combine cues on the basis of their distributional statistics. First, we simulate the developmental process of acquiring phonological categories from auditory and visual cues, asking whether simple statistical learning approaches are sufficient for learning multi-modal representations. Second, we use this time course information to explain audiovisual speech perception in adult perceivers, including cases where auditory and visual input are mismatched. Overall, we find that domain-general statistical learning techniques allow us to model the developmental trajectory of audiovisual cue integration in speech, and in turn, allow us to better understand the mechanisms that give rise to unified percepts based on multiple cues.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Automatic audiovisual integration in speech perception
    Gentilucci, M
    Cattaneo, L
    [J]. EXPERIMENTAL BRAIN RESEARCH, 2005, 167 (01) : 66 - 75
  • [2] Automatic audiovisual integration in speech perception
    Maurizio Gentilucci
    Luigi Cattaneo
    [J]. Experimental Brain Research, 2005, 167 : 66 - 75
  • [3] Perception based method for the investigation of audiovisual integration of speech
    Huhn, Zsofia
    Szirtes, Gabor
    Lorincz, Andras
    Csepe, Valeria
    [J]. NEUROSCIENCE LETTERS, 2009, 465 (03) : 204 - 209
  • [4] Modeling Cue-integration in Emotion Perception
    Goel, Srishti
    Gendron, Maria
    [J]. AFFECTIVE SCIENCE, 2022, 3 (01)
  • [5] Schizotypal traits are not related to multisensory integration or audiovisual speech perception
    Muller, Anne-Marie
    Dalal, Tyler C.
    Stevenson, Ryan A.
    [J]. CONSCIOUSNESS AND COGNITION, 2020, 86
  • [6] Audiovisual speech perception
    Sams, M.
    [J]. PERCEPTION, 1997, 26 : 78 - 78
  • [7] Development of the Mechanisms Underlying Audiovisual Speech Perception Benefit
    Lalonde, Kaylah
    Werner, Lynne A.
    [J]. BRAIN SCIENCES, 2021, 11 (01) : 1 - 17
  • [8] The Development of Cortical Responses to the Integration of Audiovisual Speech in Infancy
    Aleksandra A. W. Dopierała
    David López Pérez
    Evelyne Mercure
    Agnieszka Pluta
    Anna Malinowska-Korczak
    Samuel Evans
    Tomasz Wolak
    Przemysław Tomalski
    [J]. Brain Topography, 2023, 36 : 459 - 475
  • [9] The Development of Cortical Responses to the Integration of Audiovisual Speech in Infancy
    Dopierala, Aleksandra A. W.
    Perez, David Lopez
    Mercure, Evelyne
    Pluta, Agnieszka
    Malinowska-Korczak, Anna
    Evans, Samuel
    Wolak, Tomasz
    Tomalski, Przemyslaw
    [J]. BRAIN TOPOGRAPHY, 2023, 36 (04) : 459 - 475
  • [10] Age-Related Changes to Multisensory Integration and Audiovisual Speech Perception
    Pepper, Jessica L.
    Nuttall, Helen E.
    [J]. BRAIN SCIENCES, 2023, 13 (08)