The importance of time-frequency averaging for binaural speaker localization in reverberant environments

被引:1
|
作者
Beit-On, Hanan [1 ]
Tourbabin, Vladimir [2 ]
Rafaely, Boaz [1 ]
机构
[1] Ben Gurion Univ Negev, Sch Elect & Comp Engn, Beer Sheva, Israel
[2] Facebook Real Labs, Haifa, Israel
来源
关键词
D O I
10.21437/Interspeech.2020-2256
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
A common approach to overcoming the effect of reverberation in speaker localization is to identify the time-frequency (TF) bins in which the direct path is dominant, and then to use only these bins for estimation. Various direct-path dominance (DPD) tests have been proposed for identifying the direct-path bins. However, for a two-microphone binaural array, tests that do not employ averaging over TF bins seem to fail. In this paper, this anomaly is studied by comparing two DPD tests, in which only one has been designed to employ averaging over TF bins. An analysis of these tests shows that, in the binaural case, a TF bin that is dominated by multiple reflections may be similar to a bin with a single source. This insight can explain the high false alarm rate encountered with tests that do not employ averaging. Also, it is shown that incorporating averaging over TF bins can reduce the false alarm rate. A simulation study is presented that verifies the importance of TF averaging for a reliable selection of direct-path bins in the binaural case.
引用
收藏
页码:5071 / 5075
页数:5
相关论文
共 50 条
  • [1] Speaker Localization by Humanoid Robots in Reverberant Environments
    Tourbabin, Vladimir
    Rafaely, Boaz
    [J]. 2014 IEEE 28TH CONVENTION OF ELECTRICAL & ELECTRONICS ENGINEERS IN ISRAEL (IEEEI), 2014,
  • [2] Cepstrum Prefiltering for Binaural Source Localization in Reverberant Environments
    Parisi, Raffaele
    Camoes, Flavia
    Scarpiniti, Michele
    Uncini, Aurelio
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2012, 19 (02) : 99 - 102
  • [3] Binaural Localization of Multiple Sources in Reverberant and Noisy Environments
    Woodruff, John
    Wang, DeLiang
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (05): : 1503 - 1512
  • [4] SPATIAL AND COHERENCE CUES BASED TIME-FREQUENCY MASKING FOR BINAURAL REVERBERANT SPEECH SEPARATION
    Alinaghi, Atiyeh
    Wang, Wenwu
    Jackson, Philip J. B.
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 684 - 688
  • [5] Robust Adaptive Time Delay Estimation for Speaker Localization in Noisy and Reverberant Acoustic Environments
    Simon Doclo
    Marc Moonen
    [J]. EURASIP Journal on Advances in Signal Processing, 2003
  • [6] Robust adaptive time delay estimation for speaker localization in noisy and reverberant acoustic environments
    Doclo, S
    Moonen, M
    [J]. EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2003, 2003 (11) : 1110 - 1124
  • [7] Imposition of Sparse Priors in Adaptive Time Delay Estimation for Speaker Localization in Reverberant Environments
    Cho, Ji-Won
    Park, Hyung-Min
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2009, 16 (1-3) : 180 - 183
  • [8] Binaural Target Sound Source Localization Based on Time-frequency Units Selection
    Li Ruwei
    Li Tao
    Sun Xiaoyue
    Yang Dengcai
    Wang Qi
    [J]. JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2019, 41 (12) : 2932 - 2938
  • [9] DIRECTION OF ARRIVAL ESTIMATION IN HIGHLY REVERBERANT ENVIRONMENTS USING SOFT TIME-FREQUENCY MASK
    Tourbabin, Vladimir
    Donley, Jacob
    Rafaely, Boaz
    Mehra, Ravish
    [J]. 2019 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2019, : 383 - 387
  • [10] Exploiting Structures of Temporal Causality for Robust Speaker Localization in Reverberant Environments
    Schymura, Christopher
    Guo, Peng
    Maymon, Yanir
    Rafaely, Boaz
    Kolossa, Dorothea
    [J]. LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION (LVA/ICA 2018), 2018, 10891 : 228 - 237