AUDIO-VISUAL DISCREPANCY AND THE INFLUENCE ON VERTICAL SOUND SOURCE LOCALIZATION

被引：0

作者：

Werner, Stephan ^{[1
]}

Liebetrau, Judith ^{[1
,2
]}

Sporer, Thomas

机构：

[1] Ilmenau Univ Technol, Ilmenau, Germany

[2] Fraunhofer Inst Digital Media Technol, Ilmenau, Germany

来源：

2012 Fourth International Workshop on Quality of Multimedia Experience (QoMEX) | 2012年

关键词：

Psychoacoustics; acoustic testing; binaural auralization; localization; ventriloquist-effect;

D O I：

暂无

中图分类号：

TB8 [摄影技术];

学科分类号：

0804 ;

摘要：

Human audio perception is influenced by vision and vice versa. The effect and thresholds of perceptual fusion, for example the ventriloquism-effect, are well investigated for natural listening conditions in the horizontal plane. Modern reproduction approaches for realistic spatial audio, e.g. binaural reproduction, promise more realistic sound reproduction, though, including proper perception of direction, distance, and elevation. This raises the question if the thresholds of perceptual fusion in audio reproduction systems that consider elevation are the same as in natural listening conditions. To estimate the influence of audiovisual discrepancy on vertical sound source localization via binaural headphones, two experiments were conducted. Results show an effect of similar magnitude for the vertical and horizontal plane.

引用

页码：133 / 139

页数：7

共 50 条

[21] Localize to Binauralize: Audio Spatialization from Visual Sound Source Localization
Rachavarapu, Kranthi Kumar
Aakanksha, Aakanksha
Sundaresha, Vignesh
Rajagopalan, A. N.
[J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 1910 - 1919
[22] Deep Audio-Visual Beamforming for Speaker Localization
Qian, Xinyuan
Zhang, Qiquan
Guan, Guohui
Xue, Wei
[J]. IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 1132 - 1136
[23] Span-based Audio-Visual Localization
Wu, Yiling
Zhang, Xinfeng
Wang, Yaowei
Huang, Qingming
[J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 1252 - 1260
[24] Audio-Visual Event Localization in Unconstrained Videos
Tian, Yapeng
Shi, Jing
Li, Bochen
Duan, Zhiyao
Xu, Chenliang
[J]. COMPUTER VISION - ECCV 2018, PT II, 2018, 11206 : 252 - 268
[25] Podcast as a transmedia sound extension of audio-visual fiction
Pedrero-Esteban, Luis-Miguel
Terol-Bolinches, Raul
Arense-Gomez, Alfredo
[J]. REVISTA MEDITERRANEA COMUNICACION-JOURNAL OF COMMUNICATION, 2023, 14 (01): : 189 - 202
[26] Acoustic and Visual Knowledge Distillation for Contrastive Audio-Visual Localization
Yaghoubi, Ehsan
Kelm, Andre
Gerkmann, Timo
Frintrop, Simone
[J]. PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, ICMI 2023, 2023, : 15 - 23
[27] Active Audio-Visual Separation of Dynamic Sound Sources
Majumder, Sagnik
Grauman, Kristen
[J]. COMPUTER VISION, ECCV 2022, PT XXXIX, 2022, 13699 : 551 - 569
[28] The Audio-Visual BatVision Dataset for Research on Sight and Sound
Brunetto, Amandine
Hornauer, Sascha
Yu, Stella X.
Moutarde, Fabien
[J]. 2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 5812 - 5819
[29] iQuery: Instruments as Queries for Audio-Visual Sound Separation
Chen, Jiaben
Zhang, Renrui
Lian, Dongze
Yang, Jiaqi
Zeng, Ziyao
Shi, Jianbo
[J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 14675 - 14686
[30] Learning Audio-Visual Source Localization via False Negative Aware Contrastive Learning
Sun, Weixuan
Zhang, Jiayi
Wang, Jianyuan
Liu, Zheyuan
Zhong, Yiran
Feng, Tianpeng
Guo, Yandong
Zhang, Yanhao
Barnes, Nick
[J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 6420 - 6429

← 1 2 3 4 5 →