The contribution of dynamic visual cues to audiovisual speech perception

被引:11
|
作者
Jaekl, Philip [1 ,2 ]
Pesquita, Ana [3 ]
Alsius, Agnes [4 ]
Munhall, Kevin [4 ]
Soto-Faraco, Salvador [5 ,6 ]
机构
[1] Univ Rochester, Ctr Visual Sci, Rochester, NY 14627 USA
[2] Univ Rochester, Dept Brain & Cognit Sci, Rochester, NY USA
[3] Univ British Columbia, Dept Psychol, UBC Vis Lab, Vancouver, BC, Canada
[4] Queens Univ, Dept Psychol, Kingston, ON K7L 3N6, Canada
[5] Univ Pompeu Fabra, Dept Informat Technol & Commun, Ctr Brain & Cognit, Barcelona, Spain
[6] ICREA, Barcelona, Spain
基金
加拿大自然科学与工程研究理事会; 欧洲研究理事会;
关键词
Speech-in-noise; Visual form; Visual motion; Visual pathways; Biological motion; Configural; Audiovisual enhancement; BIOLOGICAL MOTION PERCEPTION; CRITICAL FEATURES; VISIBLE SPEECH; FROM-MOTION; FORM; FACES; RECOGNITION; INTEGRATION; ADAPTATION; MECHANISMS;
D O I
10.1016/j.neuropsychologia.2015.06.025
中图分类号
B84 [心理学]; C [社会科学总论]; Q98 [人类学];
学科分类号
03 ; 0303 ; 030303 ; 04 ; 0402 ;
摘要
Seeing a speaker's facial gestures can significantly improve speech comprehension, especially in noisy environments. However, the nature of the visual information from the speaker's facial movements that is relevant for this enhancement is still unclear. Like auditory speech signals, visual speech signals unfold over time and contain both dynamic configural information and luminance-defined local motion cues; two information sources that are thought to engage anatomically and functionally separate visual systems. Whereas, some past studies have highlighted the importance of local, luminance-defined motion cues in audiovisual speech perception, the contribution of dynamic configural information signalling changes in form over time has not yet been assessed. We therefore attempted to single out the contribution of dynamic configural information to audiovisual speech processing. To this aim, we measured word identification performance in noise using unimodal auditory stimuli, and with audiovisual stimuli. In the audiovisual condition, speaking faces were presented as point light displays achieved via motion capture of the original talker. Point light displays could be isoluminant, to minimise the contribution of effective luminance-defined local motion information, or with added luminance contrast, allowing the combined effect of dynamic configural cues and local motion cues. Audiovisual enhancement was found in both the isoluminant and contrast-based luminance conditions compared to an auditory-only condition, demonstrating, for the first time the specific contribution of dynamic configural cues to audiovisual speech improvement. These findings imply that globally processed changes in a speaker's facial shape contribute significantly towards the perception of articulatory gestures and the analysis of audiovisual speech. (C) 2015 Elsevier Ltd. All rights reserved.
引用
下载
收藏
页码:402 / 410
页数:9
相关论文
共 50 条
  • [21] Enhancement of Visual Perception with Use of Dynamic Cues
    Andia, Marcelo E.
    Plett, Johannes
    Tejos, Cristian
    Guarini, Marcelo W.
    Navarro, Maria E.
    Razmilic, Dravna
    Meneses, Luis
    Villalon, Manuel J.
    Irarrazaval, Pablo
    RADIOLOGY, 2009, 250 (02) : 551 - 557
  • [22] Audiovisual Speech Perception in Infancy: The Influence of Vowel Identity and Infants' Productive Abilities on Sensitivity to (Mis)Matches Between Auditory and Visual Speech Cues
    Altvater-Mackensen, Nicole
    Mani, Nivedita
    Grossmann, Tobias
    DEVELOPMENTAL PSYCHOLOGY, 2016, 52 (02) : 191 - 204
  • [23] High visual resolution matters in audiovisual speech perception, but only for some
    Alsius, Agnes
    Wayne, Rachel V.
    Pare, Martin
    Munhall, Kevin G.
    ATTENTION PERCEPTION & PSYCHOPHYSICS, 2016, 78 (05) : 1472 - 1487
  • [24] High visual resolution matters in audiovisual speech perception, but only for some
    Agnès Alsius
    Rachel V. Wayne
    Martin Paré
    Kevin G. Munhall
    Attention, Perception, & Psychophysics, 2016, 78 : 1472 - 1487
  • [25] Contributions of oral and extraoral facial movement to visual and audiovisual speech perception
    Thomas, SM
    Jordan, TR
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 2004, 30 (05) : 873 - 888
  • [26] Integration of Facial and Newly Learned Visual Cues in Speech Perception
    Massaro, Dom
    Cohen, Michael M.
    Meyer, Heidi
    Stribling, Tracy
    Sterling, Cass
    Vanderhyden, Sam
    AMERICAN JOURNAL OF PSYCHOLOGY, 2011, 124 (03): : 341 - 354
  • [27] Maturation of audiovisual speech perception
    Tiippana, K
    Hayes, E
    Kraus, N
    Sams, M
    JOURNAL OF COGNITIVE NEUROSCIENCE, 2002, : 42 - 43
  • [28] Visual perception of vowels from static and dynamic cues
    Rojczyk, Arkadiusz
    Ciszewski, Tomasz
    Szwoch, Grzegorz
    Czyzewski, Andrzej
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2018, 143 (05): : EL328 - EL332
  • [29] An assessment of behavioral dynamic information processing measures in audiovisual speech perception
    Altieri, Nicholas
    Townsend, James T.
    FRONTIERS IN PSYCHOLOGY, 2011, 2
  • [30] The phase of cortical oscillations determines the perceptual fate of visual cues in naturalistic audiovisual speech
    Theze, Raphael
    Giraud, Anne-Lise
    Megevand, Pierre
    SCIENCE ADVANCES, 2020, 6 (45)