Degraded visual and auditory input individually impair audiovisual emotion recognition from speech-like stimuli, but no evidence for an exacerbated effect from combined degradation

被引:8
|
作者
de Boer, Minke J. [1 ,2 ,3 ]
Juergens, Tim [4 ]
Cornelissen, Frans W. [1 ,2 ]
Baskent, Deniz [1 ,3 ]
机构
[1] Univ Groningen, Res Sch Behav & Cognit Neurosci BCN, Groningen, Netherlands
[2] Univ Groningen, Univ Med Ctr Groningen, Lab Expt Ophthalmol, Groningen, Netherlands
[3] Univ Groningen, Univ Med Ctr Groningen, Dept Otorhinolaryngol Head & Neck Surg, Groningen, Netherlands
[4] TH Lubeck, Inst Acoust, Lubeck, Germany
关键词
Emotion perception; Eye-tracking; Central scotoma; Age-related hearing loss; Audiovisual; Dynamic; HEARING-LOSS; MACULAR DEGENERATION; CENTRAL SCOTOMAS; EYE-MOVEMENTS; LOW-VISION; AGE; PERCEPTION; SEARCH; PERFORMANCE; INTEGRATION;
D O I
10.1016/j.visres.2020.12.002
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Emotion recognition requires optimal integration of the multisensory signals from vision and hearing. A sensory loss in either or both modalities can lead to changes in integration and related perceptual strategies. To investigate potential acute effects of combined impairments due to sensory information loss only, we degraded the visual and auditory information in audiovisual video-recordings, and presented these to a group of healthy young volunteers. These degradations intended to approximate some aspects of vision and hearing impairment in simulation. Other aspects, related to advanced age, potential health issues, but also long-term adaptation and cognitive compensation strategies, were not included in the simulations. Besides accuracy of emotion recognition, eye movements were recorded to capture perceptual strategies. Our data show that emotion recognition performance decreases when degraded visual and auditory information are presented in isolation, but simultaneously degrading both modalities does not exacerbate these isolated effects. Moreover, degrading the visual information strongly impacts recognition performance and on viewing behavior. In contrast, degrading auditory information alongside normal or degraded video had little (additional) effect on performance or gaze. Nevertheless, our results hold promise for visually impaired individuals, because the addition of any audio to any video greatly facilitates performance, even though adding audio does not completely compensate for the negative effects of video degradation. Additionally, observers modified their viewing behavior to degraded video in order to maximize their performance. Therefore, optimizing the hearing of visually impaired individuals and teaching them such optimized viewing behavior could be worthwhile endeavors for improving emotion recognition.
引用
收藏
页码:51 / 62
页数:12
相关论文
empty
未找到相关数据