Robust auditory localization using probabilistic inference and coherence-based weighting of interaural cues

被引:8
|
作者
Kayser, Hendrik [1 ]
Hohmann, Volker
Ewert, Stephan D.
Kollmeier, Birger
Anemueller, Joern
机构
[1] Carl von Ossietzky Univ Oldenburg, Med Phys, D-26111 Oldenburg, Germany
来源
关键词
CORRELATION DISCRIMINATION; SOUND LOCALIZATION; SPEECH; MODEL; MASKING; NOISE; SENSITIVITY;
D O I
10.1121/1.4932588
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Robust sound source localization is performed by the human auditory system even in challenging acoustic conditions and in previously unencountered, complex scenarios. Here a computational binaural localization model is proposed that possesses mechanisms for handling of corrupted or unreliable localization cues and generalization across different acoustic situations. Central to the model is the use of interaural coherence, measured as interaural vector strength (IVS), to dynamically weight the importance of observed interaural phase (IPD) and level (ILD) differences in frequency bands up to 1.4 kHz. This is accomplished through formulation of a probabilistic model in which the ILD and IPD distributions pertaining to a specific source location are dependent on observed interaural coherence. Bayesian computation of the direction-of-arrival probability map naturally leads to coherence-weighted integration of location cues across frequency and time. Results confirm the model's validity through statistical analyses of interaural parameter values. Simulated localization experiments show that even data points with low reliability (i.e., low IVS) can be exploited to enhance localization performance. A temporal integration length of at least 200 ms is required to gain a benefit; this is in accordance with previous psychoacoustic findings on temporal integration of spatial cues in the human auditory system. (C) 2015 Acoustical Society of America.
引用
收藏
页码:2635 / 2648
页数:14
相关论文
共 50 条
  • [1] Generalizing inference rules in a coherence-based probabilistic default reasoning
    Gilio, Angelo
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2012, 53 (03) : 413 - 434
  • [2] Source localization in complex listening situations: Selection of binaural cues based on interaural coherence
    Faller, C
    Merimaa, J
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2004, 116 (05): : 3075 - 3089
  • [3] ROBUST RECOGNITION OF REVERBERANT AND NOISY SPEECH USING COHERENCE-BASED PROCESSING
    Menon, Anjali
    Kim, Chanwoo
    Stern, Richard M.
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6775 - 6779
  • [4] Spectral Weighting of Monaural Cues for Auditory Localization in Sagittal Planes
    Llado, Pedro
    Majdak, Piotr
    Barumerli, Roberto
    Baumgartner, Robert
    TRENDS IN HEARING, 2025, 29
  • [5] ROBUST FULL-SPHERE BINAURAL SOUND SOURCE LOCALIZATION USING INTERAURAL AND SPECTRAL CUES
    Hammond, Benjamin R.
    Jackson, Philip J. B.
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 421 - 425
  • [6] Coherence-Based Probabilistic Recovery Guarantees for Sparsely Corrupted Signals
    Bracher, Annina
    Pope, Graeme
    Studer, Christoph
    2012 IEEE INFORMATION THEORY WORKSHOP (ITW), 2012, : 307 - 311
  • [7] EM localization and separation using interaural level and phase cues
    Mandel, Michael I.
    Ellis, Daniel P. W.
    2007 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, 2007, : 57 - 60
  • [8] Robust Inference Using Inverse Probability Weighting
    Ma, Xinwei
    Wang, Jingshen
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2020, 115 (532) : 1851 - 1860
  • [9] A Probabilistic Model for Robust Localization Based on a Binaural Auditory Front-End
    May, Tobias
    van de Par, Steven
    Kohlrausch, Armin
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (01): : 1 - 13
  • [10] A Robust Coherence-Based Brain Connectivity Method with an Application to EEG Recordings
    Yan, Jiaqing
    Wen, Jianbin
    Wang, Yinghua
    Liu, Xianzeng
    Li, Xiaoli
    ADVANCES IN COGNITIVE NEURODYNAMICS (V), 2016, : 339 - 344