A LEARNING-BASED APPROACH TO DIRECTION OF ARRIVAL ESTIMATION IN NOISY AND REVERBERANT ENVIRONMENTS

被引:0
|
作者
Xiao, Xiong [1 ]
Zhao, Shengkui [2 ]
Zhong, Xionghu [3 ]
Jones, Douglas L. [2 ]
Chng, Eng Siong [3 ]
Li, Haizhou [3 ,4 ]
机构
[1] Nanyang Technol Univ, Temasek Lab, Singapore, Singapore
[2] Adv Digital Sci Ctr, Singapore, Singapore
[3] Nanyang Technol Univ, Sch Comp Engn, Singapore, Singapore
[4] Inst Infocomm Res, Dept Human Language Technol, Singapore, Singapore
关键词
microphone arrays; direction of arrival; least squares; machine learning; neural networks; HISTOGRAM EQUALIZATION; LOCALIZATION; ADAPTATION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a learning-based approach to the task of direction of arrival estimation (DOA) from microphone array input. Traditional signal processing methods such as the classic least square (LS) method rely on strong assumptions on signal models and accurate estimations of time delay of arrival (TDOA). They only work well in relatively clean conditions, but suffer from noise and reverberation distortions. In this paper, we propose a learning-based approach that can learn from a large amount of simulated noisy and reverberant microphone array inputs for robust DOA estimation. Specifically, we extract features from the generalised cross correlation (GCC) vectors and use a multilayer perceptron neural network to learn the nonlinear mapping from such features to the DOA. One advantage of the learning based method is that as more and more training data becomes available, the DOA estimation will become more and more accurate. Experimental results on simulated data show that the proposed learning based method produces much better results than the state-of-the-art LS method. The testing results on real data recorded in meeting rooms show improved root-mean-square error (RMSE) compared to the LS method.
引用
收藏
页码:2814 / 2818
页数:5
相关论文
共 50 条
  • [21] Direction of arrival estimation using matching pursuit under reverberant condition
    Oikawa, Yasuhiro
    Yamasaki, Yoshio
    ACOUSTICAL SCIENCE AND TECHNOLOGY, 2005, 26 (04) : 365 - 367
  • [22] Decoupled Multiple Speaker Direction-of-Arrival Estimator Under Reverberant Environments
    Hu, Yonggang
    Samarasinghe, Prasanga N.
    Gannot, Sharon
    Abhayapala, Thushara D.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 3120 - 3133
  • [23] Direction of Arrival Estimation in Urban Multipath Environments
    Greenberg, Eran
    Naor, Menahem
    2016 10TH EUROPEAN CONFERENCE ON ANTENNAS AND PROPAGATION (EUCAP), 2016,
  • [24] Deep Learning Based Multi-Channel Speaker Recognition in Noisy and Reverberant Environments
    Taherian, Hassan
    Wang, Zhong-Qiu
    Wane, DeLiang
    INTERSPEECH 2019, 2019, : 4070 - 4074
  • [25] Sparse Iterative Beamforming Using Spherical Microphone Arrays for Low-Latency Direction of Arrival Estimation in Reverberant Environments
    Mathews, Jonathan
    Braasch, Jonas
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2021, 69 (12): : 967 - 977
  • [26] Deep Learning-Based Estimation of Reverberant Environment for Audio Data Augmentation
    Yun, Deokgyu
    Choi, Seung Ho
    SENSORS, 2022, 22 (02)
  • [27] Wideband Direction-of-Arrival Estimation Based on Deep Learning
    Liya Xu
    Yi Ma
    Jinfeng Zhang
    Bin Liao
    Journal of Beijing Institute of Technology, 2021, 30 (04) : 412 - 424
  • [28] Wideband Direction-of-Arrival Estimation Based on Deep Learning
    Xu L.
    Ma Y.
    Zhang J.
    Liao B.
    Zhang, Jinfeng (zhangjf@szu.edu.cn), 1600, Beijing Institute of Technology (30): : 412 - 424
  • [29] Direction of Arrival Estimation for Radionuclides Based on Neural Network Approach
    Yossi, Salomon
    Eran, Vax
    Yakir, Knafo
    Nadav, Ben David
    Alon, Osovizky
    Dan, Vilenchik
    IEEE TRANSACTIONS ON NUCLEAR SCIENCE, 2024, 71 (05) : 1124 - 1133
  • [30] Direction of arrival estimation based on minor component analysis approach
    Li, Donghai
    Gao, Shihai
    Wang, Feng
    Meng, Fankun
    NEURAL INFORMATION PROCESSING, PT 2, PROCEEDINGS, 2006, 4233 : 515 - 522