Modeling the Development of Audiovisual Cue Integration in Speech Perception

被引:5
|
作者
Getz, Laura M. [1 ]
Nordeen, Elke R. [1 ]
Vrabic, Sarah C. [1 ]
Toscano, Joseph C. [1 ]
机构
[1] Villanova Univ, Dept Psychol, Villanova, PA 19085 USA
来源
BRAIN SCIENCES | 2017年 / 7卷 / 03期
关键词
speech perception; speech development; multimodal representations; audiovisual cues; statistical learning; mixture of Gaussians; cue weighting; VISUAL SPEECH; PHONETIC INFORMATION; INTERMODAL REPRESENTATION; ACOUSTIC CHARACTERISTICS; INFANT PERCEPTION; CROSS-LANGUAGE; CHILDREN; DISCRIMINATION; SENSITIVITY; SYNCHRONY;
D O I
10.3390/brainsci7030032
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Adult speech perception is generally enhanced when information is provided from multiple modalities. In contrast, infants do not appear to benefit from combining auditory and visual speech information early in development. This is true despite the fact that both modalities are important to speech comprehension even at early stages of language acquisition. How then do listeners learn how to process auditory and visual information as part of a unified signal? In the auditory domain, statistical learning processes provide an excellent mechanism for acquiring phonological categories. Is this also true for the more complex problem of acquiring audiovisual correspondences, which require the learner to integrate information from multiple modalities? In this paper, we present simulations using Gaussian mixture models (GMMs) that learn cue weights and combine cues on the basis of their distributional statistics. First, we simulate the developmental process of acquiring phonological categories from auditory and visual cues, asking whether simple statistical learning approaches are sufficient for learning multi-modal representations. Second, we use this time course information to explain audiovisual speech perception in adult perceivers, including cases where auditory and visual input are mismatched. Overall, we find that domain-general statistical learning techniques allow us to model the developmental trajectory of audiovisual cue integration in speech, and in turn, allow us to better understand the mechanisms that give rise to unified percepts based on multiple cues.
引用
收藏
页数:23
相关论文
共 50 条
  • [41] Audiovisual speech perception of multilingual learners of Japanese
    Woodman, Katarina
    Manalo, Emmanuel
    [J]. INTERNATIONAL JOURNAL OF MULTILINGUALISM, 2024,
  • [42] Audiovisual Binding for Speech Perception in Noise and in Aging
    Ganesh, Attigodu Chandrashekara
    Berthommier, Frederic
    Schwartz, Jean-Luc
    [J]. LANGUAGE LEARNING, 2018, 68 : 193 - 220
  • [43] Audiovisual speech perception: Moving beyond McGurk
    Van Engen, Kristin J. J.
    Dey, Avanti
    Sommers, Mitchell S. S.
    Peelle, Jonathan E. E.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2022, 152 (06): : 3216 - 3225
  • [44] Cue Integration in Color and Material Perception
    Saarela, Toni
    [J]. I-PERCEPTION, 2019, 10 : 36 - 36
  • [45] Influences of selective adaptation on perception of audiovisual speech
    Dias, James W.
    Cook, Theresa C.
    Rosenblum, Lawrence D.
    [J]. JOURNAL OF PHONETICS, 2016, 56 : 75 - 84
  • [46] Neural processing of asynchronous audiovisual speech perception
    Stevenson, Ryan A.
    Altieri, Nicholas A.
    Kim, Sunah
    Pisoni, David B.
    James, Thomas W.
    [J]. NEUROIMAGE, 2010, 49 (04) : 3308 - 3318
  • [47] Spatial frequency requirements for audiovisual speech perception
    Munhall, KG
    Kroos, C
    Jozan, G
    Vatikiotis-Bateson, E
    [J]. PERCEPTION & PSYCHOPHYSICS, 2004, 66 (04): : 574 - 583
  • [48] Spatial frequency requirements for audiovisual speech perception
    K. G. Munhall
    C. Kroos
    G. Jozan
    E. Vatikiotis-Bateson
    [J]. Perception & Psychophysics, 2004, 66 : 574 - 583
  • [49] Subliminal Smells Modulate Audiovisual Speech Perception
    Chen, Jennifer
    Wang, Jin
    Chen, Denise
    [J]. CHEMICAL SENSES, 2015, 40 (07) : 641 - 642
  • [50] Spatial and temporal influences on audiovisual speech perception
    Jones, JA
    Munhall, KG
    [J]. INTERNATIONAL JOURNAL OF PSYCHOLOGY, 1996, 31 (3-4) : 4734 - 4734