Speaker localization using direct path dominance test based on sound field directivity

被引:27
|
作者
Rafaely, Boaz [1 ]
Alhaiany, Koby [1 ]
机构
[1] Ben Gurion Univ Negev, Dept Elect & Comp Engn, Beer Sheva, Israel
基金
以色列科学基金会;
关键词
Speaker localization; Reverberation; Spherical microphone arrays; Directivity; MULTIPLE; ARRAYS;
D O I
10.1016/j.sigpro.2017.08.010
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Estimation of the direction-of-arrival (DoA) of a speaker in a room is important in many audio signal processing applications. Environments with reverberation that masks the DoA information are particularly challenging. Recently, a DoA estimation method that is robust to reverberation has been developed. This method identifies time-frequency bins dominated by the contribution from the direct path, which carries the correct DoA information. However, its implementation is computationally demanding as it requires frequency smoothing to overcome the effect of coherent early reflections and matrix decomposition to apply the direct-path dominance (DPD) test. In this work, a novel computationally-efficient alternative to the DPD test is proposed, based on the directivity measure for sensor arrays, which requires neither frequency smoothing nor matrix decomposition, and which has been reformulated for sound field directivity with spherical microphone arrays. The paper presents the proposed method and a comparison to previous methods under a range of reverberation and noise conditions. Result demonstrate that the proposed method shows comparable performance to the original method in terms of robustness to reverberation and noise, and is about four times more computationally efficient for the given experiment. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:42 / 47
页数:6
相关论文
共 50 条
  • [1] Speaker localization using the direct-path dominance test for arbitrary arrays
    Beit-On, Hanan
    Rafaely, Boaz
    2018 IEEE INTERNATIONAL CONFERENCE ON THE SCIENCE OF ELECTRICAL ENGINEERING IN ISRAEL (ICSEE), 2018,
  • [2] SPEAKER LOCALIZATION IN REVERBERANT ROOMS BASED ON DIRECT PATH DOMINANCE TEST STATISTICS
    Rafaely, Boaz
    Kolossa, Dorothea
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 6120 - 6124
  • [3] Improved Direct-path Dominance Test for Speaker Localization in Reverberant Environments
    Madmoni, Lior
    Rafaely, Boaz
    2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 2424 - 2428
  • [4] Sound source localization based on directivity of MEMS microphones
    Wu, XM
    Ren, TL
    Liu, LT
    2004: 7TH INTERNATIONAL CONFERENCE ON SOLID-STATE AND INTEGRATED CIRCUITS TECHNOLOGY, VOLS 1- 3, PROCEEDINGS, 2004, : 1884 - 1887
  • [5] Localization of Multiple Speakers under High Reverberation using a Spherical Microphone Array and the Direct-Path Dominance Test
    Nadiri, O.
    Rafaely, B.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (10) : 1494 - 1505
  • [6] Robust direction-of-arrival estimation for a target speaker based on multi-task U-net based direct-path dominance test
    Wang, Hao
    Gu, Zhaoyi
    Chen, Kai
    Lu, Jing
    JASA EXPRESS LETTERS, 2021, 1 (02):
  • [7] Reverberant Sound Localization with a Robot Head Based on Direct-Path Relative Transfer Function
    Li, Xiaofei
    Girin, Laurent
    Badeig, Fabien
    Horaud, Radu
    2016 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2016), 2016, : 2819 - 2826
  • [8] Automatic speaker tracking by camera using two-channel-based sound source localization
    Sayoud, Halim
    Ouamour, Siham
    Khennouf, Salah
    INTERNATIONAL JOURNAL OF INTELLIGENT COMPUTING AND CYBERNETICS, 2011, 4 (01) : 40 - 60
  • [9] DIRECTION OF ARRIVAL ESTIMATION USING PSEUDO-INTENSITY VECTORS WITH DIRECT-PATH DOMINANCE TEST
    Moore, Alastair H.
    Evers, Christine
    Naylor, Patrick A.
    Alon, David L.
    Rafaely, Boaz
    2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 2296 - 2300
  • [10] Multiple-Speaker Localization Based on Direct-Path Features and Likelihood Maximization With Spatial Sparsity Regularization
    Li, Xiaofei
    Girin, Laurent
    Horaud, Radu
    Gannot, Sharon
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (10) : 1997 - 2012