A LEARNING-BASED APPROACH TO DIRECTION OF ARRIVAL ESTIMATION IN NOISY AND REVERBERANT ENVIRONMENTS

被引:0
|
作者
Xiao, Xiong [1 ]
Zhao, Shengkui [2 ]
Zhong, Xionghu [3 ]
Jones, Douglas L. [2 ]
Chng, Eng Siong [3 ]
Li, Haizhou [3 ,4 ]
机构
[1] Nanyang Technol Univ, Temasek Lab, Singapore, Singapore
[2] Adv Digital Sci Ctr, Singapore, Singapore
[3] Nanyang Technol Univ, Sch Comp Engn, Singapore, Singapore
[4] Inst Infocomm Res, Dept Human Language Technol, Singapore, Singapore
关键词
microphone arrays; direction of arrival; least squares; machine learning; neural networks; HISTOGRAM EQUALIZATION; LOCALIZATION; ADAPTATION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a learning-based approach to the task of direction of arrival estimation (DOA) from microphone array input. Traditional signal processing methods such as the classic least square (LS) method rely on strong assumptions on signal models and accurate estimations of time delay of arrival (TDOA). They only work well in relatively clean conditions, but suffer from noise and reverberation distortions. In this paper, we propose a learning-based approach that can learn from a large amount of simulated noisy and reverberant microphone array inputs for robust DOA estimation. Specifically, we extract features from the generalised cross correlation (GCC) vectors and use a multilayer perceptron neural network to learn the nonlinear mapping from such features to the DOA. One advantage of the learning based method is that as more and more training data becomes available, the DOA estimation will become more and more accurate. Experimental results on simulated data show that the proposed learning based method produces much better results than the state-of-the-art LS method. The testing results on real data recorded in meeting rooms show improved root-mean-square error (RMSE) compared to the LS method.
引用
收藏
页码:2814 / 2818
页数:5
相关论文
共 50 条
  • [31] Direction of Arrival Estimation Based on Minor Component Analysis Approach
    Cui Hao
    Li Donghai
    Zhao Yongjun
    PROCEEDINGS OF THE SECOND INTERNATIONAL SYMPOSIUM ON TEST AUTOMATION & INSTRUMENTATION, VOL. 3, 2008, : 1570 - 1574
  • [32] DIRECTION OF ARRIVAL ESTIMATION FOR REVERBERANT SPEECH BASED ON NEURAL NETWORKS AND THE DIRECT-PATH DOMINANCE TEST
    Ben Zaken, Orel
    Rafaely, Boaz
    Kumar, Anurag
    Tourbabin, Vladimir
    2022 INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC 2022), 2022,
  • [33] A systematic study of DNN based speech enhancement in reverberant and reverberant-noisy environments
    Wang, Heming
    Pandey, Ashutosh
    Wang, Deliang
    COMPUTER SPEECH AND LANGUAGE, 2025, 89
  • [34] Deep learning-based direction-of-arrival estimation for multiple speech sources using a small scale arraya)
    Zhang, Min
    Pan, Xiang
    Shen, Yining
    Qiu, Jianjun
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2021, 149 (06): : 3841 - 3850
  • [35] Subspace-based direction of arrival estimation in colored ambient noise environments
    Yang, Long
    Yang, Yixin
    Zhang, Yahao
    DIGITAL SIGNAL PROCESSING, 2020, 99
  • [36] Subspace-based direction of arrival estimation in colored ambient noise environments
    Yang, Yixin (yxyang@nwpu.edu.cn), 1600, Elsevier Inc. (99):
  • [37] Robust time delay estimation in noisy reverberant environments with a probabilistic grapifucal model
    Kim, T
    Attias, H
    Lee, SY
    Lee, TW
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 525 - 528
  • [38] MAXIMUM LIKELIHOOD ESTIMATION OF THE LATE REVERBERANT POWER SPECTRAL DENSITY IN NOISY ENVIRONMENTS
    Schwartz, Ofer
    Braun, Sebastian
    Gannot, Sharon
    Habets, Emanuel A. P.
    2015 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2015,
  • [39] SEISMIC DETECTION AND TIME OF ARRIVAL ESTIMATION IN NOISY ENVIRONMENTS BASED ON THE HAAR WAVELET TRANSFORM
    Thanasopoulos, Ioannis A.
    Avaritsiotis, John N.
    2008 IEEE SENSOR ARRAY AND MULTICHANNEL SIGNAL PROCESSING WORKSHOP, 2008, : 434 - 437
  • [40] Theory of the cubic autoproduct and its utility for noisy direction of arrival estimation
    Joslyn, Nicholas J.
    Dowling, David R.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2024, 156 (03): : 1887 - 1902