Speaker localization using direct path dominance test based on sound field directivity

被引:27
|
作者
Rafaely, Boaz [1 ]
Alhaiany, Koby [1 ]
机构
[1] Ben Gurion Univ Negev, Dept Elect & Comp Engn, Beer Sheva, Israel
基金
以色列科学基金会;
关键词
Speaker localization; Reverberation; Spherical microphone arrays; Directivity; MULTIPLE; ARRAYS;
D O I
10.1016/j.sigpro.2017.08.010
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Estimation of the direction-of-arrival (DoA) of a speaker in a room is important in many audio signal processing applications. Environments with reverberation that masks the DoA information are particularly challenging. Recently, a DoA estimation method that is robust to reverberation has been developed. This method identifies time-frequency bins dominated by the contribution from the direct path, which carries the correct DoA information. However, its implementation is computationally demanding as it requires frequency smoothing to overcome the effect of coherent early reflections and matrix decomposition to apply the direct-path dominance (DPD) test. In this work, a novel computationally-efficient alternative to the DPD test is proposed, based on the directivity measure for sensor arrays, which requires neither frequency smoothing nor matrix decomposition, and which has been reformulated for sound field directivity with spherical microphone arrays. The paper presents the proposed method and a comparison to previous methods under a range of reverberation and noise conditions. Result demonstrate that the proposed method shows comparable performance to the original method in terms of robustness to reverberation and noise, and is about four times more computationally efficient for the given experiment. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:42 / 47
页数:6
相关论文
共 50 条
  • [21] Learning Deep Direct-Path Relative Transfer Function for Binaural Sound Source Localization
    Yang, Bing
    Liu, Hong
    Li, Xiaofei
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 3491 - 3503
  • [22] SUPERVISED DIRECT-PATH RELATIVE TRANSFER FUNCTION LEARNING FOR BINAURAL SOUND SOURCE LOCALIZATION
    Yang, Bing
    Li, Xiaofei
    Liu, Hong
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 825 - 829
  • [23] Sound field reproduction system using narrow directivity microphones and boundary surface control principle
    Kashiwazaki, Hiroshi
    Omoto, Akira
    ACOUSTICAL SCIENCE AND TECHNOLOGY, 2018, 39 (04) : 295 - 304
  • [24] Direct-Path Signal Cross-Correlation Estimation for Sound Source Localization in Reverberation
    Xue, Wei
    Tong, Ying
    Ding, Guohong
    Zhang, Chao
    Ma, Tao
    He, Xiaodong
    Zhou, Bowen
    INTERSPEECH 2019, 2019, : 2693 - 2697
  • [25] Estimation of the Direct-Path Relative Transfer Function for Supervised Sound-Source Localization
    Li, Xiaofei
    Girin, Laurent
    Horaud, Radu
    Gannot, Sharon
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (11) : 2171 - 2186
  • [26] Multiple sound source localization using gammatone auditory filtering and direct sound componence detection
    Chen, Huaiyu
    Cao, Li
    3RD INTERNATIONAL CONFERENCE ON ADVANCES IN ENERGY, ENVIRONMENT AND CHEMICAL ENGINEERING, 2017, 69
  • [27] Sound Source Localization Using Non-Conformal Surface Sound Field Transformation Based on Spherical Harmonic Wave Decomposition
    Zhang, Lanyue
    Ding, Dandan
    Yang, Desen
    Wang, Jia
    Shi, Jie
    SENSORS, 2017, 17 (05)
  • [28] Direct-path based fingerprint extraction algorithm for indoor localization
    Zhu, Dali
    Zhao, Bobai
    Wang, Siye
    Wu, Di
    PROCEEDINGS OF THE 15TH EAI INTERNATIONAL CONFERENCE ON MOBILE AND UBIQUITOUS SYSTEMS: COMPUTING, NETWORKING AND SERVICES (MOBIQUITOUS 2018), 2018, : 11 - 18
  • [29] Fast Calculation of Far-Field Sound Directivity Based on Fast Multipole Boundary Element Method
    Masumoto, Takayuki
    Yasuda, Yosuke
    Inoue, Naohisa
    Sakuma, Tetsuya
    JOURNAL OF THEORETICAL AND COMPUTATIONAL ACOUSTICS, 2020, 28 (04):
  • [30] Speaker localization for far-field and near-field wideband sources using neural networks
    Arslan, G
    Sakarya, FA
    Evans, BL
    PROCEEDINGS OF THE IEEE-EURASIP WORKSHOP ON NONLINEAR SIGNAL AND IMAGE PROCESSING (NSIP'99), 1999, : 528 - 532