AUDIO-VISUAL DISCREPANCY AND THE INFLUENCE ON VERTICAL SOUND SOURCE LOCALIZATION

被引:0
|
作者
Werner, Stephan [1 ]
Liebetrau, Judith [1 ,2 ]
Sporer, Thomas
机构
[1] Ilmenau Univ Technol, Ilmenau, Germany
[2] Fraunhofer Inst Digital Media Technol, Ilmenau, Germany
关键词
Psychoacoustics; acoustic testing; binaural auralization; localization; ventriloquist-effect;
D O I
暂无
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
Human audio perception is influenced by vision and vice versa. The effect and thresholds of perceptual fusion, for example the ventriloquism-effect, are well investigated for natural listening conditions in the horizontal plane. Modern reproduction approaches for realistic spatial audio, e.g. binaural reproduction, promise more realistic sound reproduction, though, including proper perception of direction, distance, and elevation. This raises the question if the thresholds of perceptual fusion in audio reproduction systems that consider elevation are the same as in natural listening conditions. To estimate the influence of audiovisual discrepancy on vertical sound source localization via binaural headphones, two experiments were conducted. Results show an effect of similar magnitude for the vertical and horizontal plane.
引用
收藏
页码:133 / 139
页数:7
相关论文
共 50 条
  • [21] Localize to Binauralize: Audio Spatialization from Visual Sound Source Localization
    Rachavarapu, Kranthi Kumar
    Aakanksha, Aakanksha
    Sundaresha, Vignesh
    Rajagopalan, A. N.
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 1910 - 1919
  • [22] Deep Audio-Visual Beamforming for Speaker Localization
    Qian, Xinyuan
    Zhang, Qiquan
    Guan, Guohui
    Xue, Wei
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 1132 - 1136
  • [23] Span-based Audio-Visual Localization
    Wu, Yiling
    Zhang, Xinfeng
    Wang, Yaowei
    Huang, Qingming
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 1252 - 1260
  • [24] Audio-Visual Event Localization in Unconstrained Videos
    Tian, Yapeng
    Shi, Jing
    Li, Bochen
    Duan, Zhiyao
    Xu, Chenliang
    [J]. COMPUTER VISION - ECCV 2018, PT II, 2018, 11206 : 252 - 268
  • [25] Podcast as a transmedia sound extension of audio-visual fiction
    Pedrero-Esteban, Luis-Miguel
    Terol-Bolinches, Raul
    Arense-Gomez, Alfredo
    [J]. REVISTA MEDITERRANEA COMUNICACION-JOURNAL OF COMMUNICATION, 2023, 14 (01): : 189 - 202
  • [26] Acoustic and Visual Knowledge Distillation for Contrastive Audio-Visual Localization
    Yaghoubi, Ehsan
    Kelm, Andre
    Gerkmann, Timo
    Frintrop, Simone
    [J]. PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, ICMI 2023, 2023, : 15 - 23
  • [27] Active Audio-Visual Separation of Dynamic Sound Sources
    Majumder, Sagnik
    Grauman, Kristen
    [J]. COMPUTER VISION, ECCV 2022, PT XXXIX, 2022, 13699 : 551 - 569
  • [28] The Audio-Visual BatVision Dataset for Research on Sight and Sound
    Brunetto, Amandine
    Hornauer, Sascha
    Yu, Stella X.
    Moutarde, Fabien
    [J]. 2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 5812 - 5819
  • [29] iQuery: Instruments as Queries for Audio-Visual Sound Separation
    Chen, Jiaben
    Zhang, Renrui
    Lian, Dongze
    Yang, Jiaqi
    Zeng, Ziyao
    Shi, Jianbo
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 14675 - 14686
  • [30] Learning Audio-Visual Source Localization via False Negative Aware Contrastive Learning
    Sun, Weixuan
    Zhang, Jiayi
    Wang, Jianyuan
    Liu, Zheyuan
    Zhong, Yiran
    Feng, Tianpeng
    Guo, Yandong
    Zhang, Yanhao
    Barnes, Nick
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 6420 - 6429