DIRECTION OF ARRIVAL ESTIMATION FOR REVERBERANT SPEECH BASED ON NEURAL NETWORKS AND THE DIRECT-PATH DOMINANCE TEST

被引:0
|
作者
Ben Zaken, Orel [1 ]
Rafaely, Boaz [1 ]
Kumar, Anurag [2 ]
Tourbabin, Vladimir [2 ]
机构
[1] Ben Gurion Univ Negev, Sch Elect & Comp Engn, Beer Sheva, Israel
[2] Real Labs Res Meta, 1 Hacker Way, Menlo Pk, CA 94025 USA
关键词
Speaker localization; spherical arrays; machine learning; LOCALIZATION;
D O I
10.1109/IWAENC53105.2022.9914696
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In reverberant environments, typical of real-world scenarios, direction of arrival (DOA) estimation for speech sources appears to be a challenging problem in audio signal processing. An effective way of overcoming this challenge is to perform a direct-path dominance (DPD) test. The DPD test identifies time frequency bins dominated by the direct sound and holds accurate DOA data. In recent years, methods based on neural networks (NN) have been developed to estimate DOA. Based on the latter approach, this work proposes a NN based method, for spherical arrays, that is a generalization of the original DPD test method and aims to improve its performance by utilizing new information in the data, while preserving its advantages. This article presents the results of the proposed method for a single speaker in a room, and analyzes which features contain useful information about the direct sound by evaluating performance for simulated data.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] U-net Based Direct-path Dominance Test for Robust Direction-of-arrival Estimation
    Wang, Hao
    Chen, Kai
    Lu, Jing
    [J]. INTERSPEECH 2020, 2020, : 5086 - 5090
  • [2] DIRECTION OF ARRIVAL ESTIMATION USING PSEUDO-INTENSITY VECTORS WITH DIRECT-PATH DOMINANCE TEST
    Moore, Alastair H.
    Evers, Christine
    Naylor, Patrick A.
    Alon, David L.
    Rafaely, Boaz
    [J]. 2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 2296 - 2300
  • [3] Improved Direct-path Dominance Test for Speaker Localization in Reverberant Environments
    Madmoni, Lior
    Rafaely, Boaz
    [J]. 2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 2424 - 2428
  • [4] Direction of Arrival Estimation for Reverberant Speech Based on Enhanced Decomposition of the Direct Sound
    Madmoni, Lior
    Rafaely, Boaz
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2019, 13 (01) : 131 - 142
  • [5] Robust direction-of-arrival estimation for a target speaker based on multi-task U-net based direct-path dominance test
    Wang, Hao
    Gu, Zhaoyi
    Chen, Kai
    Lu, Jing
    [J]. JASA EXPRESS LETTERS, 2021, 1 (02):
  • [6] Robust direction of arrival estimation for speech enhancement in noisy reverberant rooms
    Lotter, T
    Loellmann, HW
    Vary, P
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 4186 - 4186
  • [7] SPEAKER LOCALIZATION IN REVERBERANT ROOMS BASED ON DIRECT PATH DOMINANCE TEST STATISTICS
    Rafaely, Boaz
    Kolossa, Dorothea
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 6120 - 6124
  • [8] Neural-Network-Based Direction-of-Arrival Estimation for Reverberant Speech - The Importance of Energetic, Temporal, and Spatial Information
    Ben Zaken, Orel
    Kumar, Anurag
    Tourbabin, Vladimir
    Rafaely, Boaz
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 1298 - 1309
  • [9] Speaker localization using the direct-path dominance test for arbitrary arrays
    Beit-On, Hanan
    Rafaely, Boaz
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON THE SCIENCE OF ELECTRICAL ENGINEERING IN ISRAEL (ICSEE), 2018,
  • [10] Reverberant Sound Localization with a Robot Head Based on Direct-Path Relative Transfer Function
    Li, Xiaofei
    Girin, Laurent
    Badeig, Fabien
    Horaud, Radu
    [J]. 2016 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2016), 2016, : 2819 - 2826