The contribution of dynamic visual cues to audiovisual speech perception

被引:11
|
作者
Jaekl, Philip [1 ,2 ]
Pesquita, Ana [3 ]
Alsius, Agnes [4 ]
Munhall, Kevin [4 ]
Soto-Faraco, Salvador [5 ,6 ]
机构
[1] Univ Rochester, Ctr Visual Sci, Rochester, NY 14627 USA
[2] Univ Rochester, Dept Brain & Cognit Sci, Rochester, NY USA
[3] Univ British Columbia, Dept Psychol, UBC Vis Lab, Vancouver, BC, Canada
[4] Queens Univ, Dept Psychol, Kingston, ON K7L 3N6, Canada
[5] Univ Pompeu Fabra, Dept Informat Technol & Commun, Ctr Brain & Cognit, Barcelona, Spain
[6] ICREA, Barcelona, Spain
基金
加拿大自然科学与工程研究理事会; 欧洲研究理事会;
关键词
Speech-in-noise; Visual form; Visual motion; Visual pathways; Biological motion; Configural; Audiovisual enhancement; BIOLOGICAL MOTION PERCEPTION; CRITICAL FEATURES; VISIBLE SPEECH; FROM-MOTION; FORM; FACES; RECOGNITION; INTEGRATION; ADAPTATION; MECHANISMS;
D O I
10.1016/j.neuropsychologia.2015.06.025
中图分类号
B84 [心理学]; C [社会科学总论]; Q98 [人类学];
学科分类号
03 ; 0303 ; 030303 ; 04 ; 0402 ;
摘要
Seeing a speaker's facial gestures can significantly improve speech comprehension, especially in noisy environments. However, the nature of the visual information from the speaker's facial movements that is relevant for this enhancement is still unclear. Like auditory speech signals, visual speech signals unfold over time and contain both dynamic configural information and luminance-defined local motion cues; two information sources that are thought to engage anatomically and functionally separate visual systems. Whereas, some past studies have highlighted the importance of local, luminance-defined motion cues in audiovisual speech perception, the contribution of dynamic configural information signalling changes in form over time has not yet been assessed. We therefore attempted to single out the contribution of dynamic configural information to audiovisual speech processing. To this aim, we measured word identification performance in noise using unimodal auditory stimuli, and with audiovisual stimuli. In the audiovisual condition, speaking faces were presented as point light displays achieved via motion capture of the original talker. Point light displays could be isoluminant, to minimise the contribution of effective luminance-defined local motion information, or with added luminance contrast, allowing the combined effect of dynamic configural cues and local motion cues. Audiovisual enhancement was found in both the isoluminant and contrast-based luminance conditions compared to an auditory-only condition, demonstrating, for the first time the specific contribution of dynamic configural cues to audiovisual speech improvement. These findings imply that globally processed changes in a speaker's facial shape contribute significantly towards the perception of articulatory gestures and the analysis of audiovisual speech. (C) 2015 Elsevier Ltd. All rights reserved.
引用
下载
收藏
页码:402 / 410
页数:9
相关论文
共 50 条
  • [1] ON THE ROLE OF VISUAL CUES IN AUDIOVISUAL SPEECH ENHANCEMENT
    Aldeneh, Zakaria
    Kumar, Anushree Prasanna
    Theobald, Barry-John
    Marchi, Erik
    Kajarekar, Sachin
    Naik, Devang
    Abdelaziz, Ahmed Hussen
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 8423 - 8427
  • [2] Crossmodal and incremental perception of audiovisual cues to emotional speech
    Barkhuysen, Pashiera
    Krahmer, Emiel
    Swerts, Marc
    LANGUAGE AND SPEECH, 2010, 53 : 3 - 30
  • [3] Visual attention modulates audiovisual speech perception
    Tiippana, K
    Andersen, TS
    Sams, M
    EUROPEAN JOURNAL OF COGNITIVE PSYCHOLOGY, 2004, 16 (03): : 457 - 472
  • [4] Audiovisual Mandarin Lexical Tone Perception in Quiet and Noisy Contexts: The Influence of Visual Cues and Speech Rate
    Li, Manhong
    Chen, Xiaoxiang
    Zhu, Jiaqiang
    Chen, Fei
    JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2022, 65 (11): : 4385 - 4403
  • [5] Modulation of perception by visual, auditory and audiovisual reward predicting cues
    Antono, Jessica Emily
    Pooresmaeili, Arezoo
    PERCEPTION, 2022, 51 : 54 - 54
  • [6] COMPARATIVE ANALYSIS OF AUDIOVISUAL, AUDITIVE AND VISUAL PERCEPTION OF SPEECH
    EWERTSEN, HW
    NIELSEN, HB
    ACTA OTO-LARYNGOLOGICA, 1971, 72 (03) : 201 - &
  • [7] The role of visual spatial attention in audiovisual speech perception
    Andersen, Tobias S.
    Tiippana, Kaisa
    Laarni, Jari
    Kojo, Ilpo
    Sams, Mikko
    SPEECH COMMUNICATION, 2009, 51 (02) : 184 - 193
  • [8] An audiovisual test of kinematic primitives for visual speech perception
    Rosenblum, LD
    Saldana, HM
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 1996, 22 (02) : 318 - 331
  • [9] Visual and Auditory Components in the Perception of Asynchronous Audiovisual Speech
    Garcia-Perez, Miguel A.
    Alcala-Quintana, Rocio
    I-PERCEPTION, 2015, 6 (06): : 1 - 20
  • [10] Visual Hearing Aids: Artificial Visual Speech Stimuli for Audiovisual Speech Perception in Noise
    Choudhary, Zubin Datta
    Bruder, Gerd
    Welch, Gregory F.
    29TH ACM SYMPOSIUM ON VIRTUAL REALITY SOFTWARE AND TECHNOLOGY, VRST 2023, 2023,