DIRECTION OF ARRIVAL ESTIMATION FOR REVERBERANT SPEECH BASED ON NEURAL NETWORKS AND THE DIRECT-PATH DOMINANCE TEST

被引:0
|
作者
Ben Zaken, Orel [1 ]
Rafaely, Boaz [1 ]
Kumar, Anurag [2 ]
Tourbabin, Vladimir [2 ]
机构
[1] Ben Gurion Univ Negev, Sch Elect & Comp Engn, Beer Sheva, Israel
[2] Real Labs Res Meta, 1 Hacker Way, Menlo Pk, CA 94025 USA
关键词
Speaker localization; spherical arrays; machine learning; LOCALIZATION;
D O I
10.1109/IWAENC53105.2022.9914696
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In reverberant environments, typical of real-world scenarios, direction of arrival (DOA) estimation for speech sources appears to be a challenging problem in audio signal processing. An effective way of overcoming this challenge is to perform a direct-path dominance (DPD) test. The DPD test identifies time frequency bins dominated by the direct sound and holds accurate DOA data. In recent years, methods based on neural networks (NN) have been developed to estimate DOA. Based on the latter approach, this work proposes a NN based method, for spherical arrays, that is a generalization of the original DPD test method and aims to improve its performance by utilizing new information in the data, while preserving its advantages. This article presents the results of the proposed method for a single speaker in a room, and analyzes which features contain useful information about the direct sound by evaluating performance for simulated data.
引用
收藏
页数:5
相关论文
共 50 条
  • [31] Performance analysis of deep neural networks for direction of arrival estimation of multiple sources
    Chen, Min
    Mao, Xingpeng
    Wang, Xiuhong
    [J]. IET SIGNAL PROCESSING, 2023, 17 (03)
  • [32] Direction-of-Arrival Estimation With A Vector Sensor Using Deep Neural Networks
    Yu, Jianyuan
    Howard, William W.
    Tait, Daniel
    Buehrer, R. Michael
    [J]. 2021 IEEE 93RD VEHICULAR TECHNOLOGY CONFERENCE (VTC2021-SPRING), 2021,
  • [33] Regression and Classification for Direction-of-Arrival Estimation with Convolutional Recurrent Neural Networks
    Tang, Zhenyu
    Kanu, John D.
    Hogan, Kevin
    Manocha, Dinesh
    [J]. INTERSPEECH 2019, 2019, : 654 - 658
  • [34] Improved direction-of-arrival estimation method based on LSTM neural networks with robustness to array imperfections
    Xiang, Houhong
    Chen, Baixiao
    Yang, Minglei
    Xu, Saiqin
    Li, Zhengjie
    [J]. APPLIED INTELLIGENCE, 2021, 51 (07) : 4420 - 4433
  • [35] Improved direction-of-arrival estimation method based on LSTM neural networks with robustness to array imperfections
    Houhong Xiang
    Baixiao Chen
    Minglei Yang
    Saiqin Xu
    Zhengjie Li
    [J]. Applied Intelligence, 2021, 51 : 4420 - 4433
  • [36] Neural Network based Direction of Arrival Estimation for a MIMO OFDM Radar
    Sit, Yoke Leen
    Agatonovic, Marija
    Zwick, Thomas
    [J]. 2012 9TH EUROPEAN RADAR CONFERENCE (EURAD), 2012, : 298 - 301
  • [37] Direction of Arrival Estimation of Array Defects Based on Deep Neural Network
    Li, Jianxiong
    Shao, Xingkai
    Li, Jie
    Ge, Lijun
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2022, 41 (09) : 4906 - 4927
  • [38] Direction of arrival estimation in passive radar based on deep neural network
    Lyu, Xiaoyong
    Wang, Jun
    [J]. IET SIGNAL PROCESSING, 2021, 15 (09) : 612 - 621
  • [39] Direction of Arrival Estimation of Array Defects Based on Deep Neural Network
    Jianxiong Li
    Xingkai Shao
    Jie Li
    Lijun Ge
    [J]. Circuits, Systems, and Signal Processing, 2022, 41 : 4906 - 4927
  • [40] Estimation of the Acoustic Time Delay of Arrival by Adaptive Eigenvalue Decomposition with a Proportionate Step-Size Control and Direct-Path Constraint
    Lee, Seokjin
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2016, E99A (08) : 1622 - 1627