A LEARNING-BASED APPROACH TO DIRECTION OF ARRIVAL ESTIMATION IN NOISY AND REVERBERANT ENVIRONMENTS

被引:0
|
作者
Xiao, Xiong [1 ]
Zhao, Shengkui [2 ]
Zhong, Xionghu [3 ]
Jones, Douglas L. [2 ]
Chng, Eng Siong [3 ]
Li, Haizhou [3 ,4 ]
机构
[1] Nanyang Technol Univ, Temasek Lab, Singapore, Singapore
[2] Adv Digital Sci Ctr, Singapore, Singapore
[3] Nanyang Technol Univ, Sch Comp Engn, Singapore, Singapore
[4] Inst Infocomm Res, Dept Human Language Technol, Singapore, Singapore
关键词
microphone arrays; direction of arrival; least squares; machine learning; neural networks; HISTOGRAM EQUALIZATION; LOCALIZATION; ADAPTATION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a learning-based approach to the task of direction of arrival estimation (DOA) from microphone array input. Traditional signal processing methods such as the classic least square (LS) method rely on strong assumptions on signal models and accurate estimations of time delay of arrival (TDOA). They only work well in relatively clean conditions, but suffer from noise and reverberation distortions. In this paper, we propose a learning-based approach that can learn from a large amount of simulated noisy and reverberant microphone array inputs for robust DOA estimation. Specifically, we extract features from the generalised cross correlation (GCC) vectors and use a multilayer perceptron neural network to learn the nonlinear mapping from such features to the DOA. One advantage of the learning based method is that as more and more training data becomes available, the DOA estimation will become more and more accurate. Experimental results on simulated data show that the proposed learning based method produces much better results than the state-of-the-art LS method. The testing results on real data recorded in meeting rooms show improved root-mean-square error (RMSE) compared to the LS method.
引用
收藏
页码:2814 / 2818
页数:5
相关论文
共 50 条
  • [1] Direction of Arrival Estimation of Kiwi Call in Noisy and Reverberant Bush
    Gray, Craig
    Hioka, Yusuke
    [J]. 2014 IEEE SENSORS APPLICATIONS SYMPOSIUM (SAS), 2014, : 258 - 262
  • [2] Robust direction of arrival estimation for speech enhancement in noisy reverberant rooms
    Lotter, T
    Loellmann, HW
    Vary, P
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 4186 - 4186
  • [3] Direction-of-arrival estimation of passive acoustic sources in reverberant environments based on the Householder transformation
    Huang, Gongping
    Chen, Jingdong
    Benesty, Jacob
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2015, 138 (05): : 3053 - 3060
  • [4] Assessing the Generalization Gap of Learning-Based Speech Enhancement Systems in Noisy and Reverberant Environments
    Gonzalez, Philippe
    Alstrom, Tommy Sonne
    May, Tobias
    [J]. IEEE/ACM Transactions on Audio Speech and Language Processing, 2023, 31 : 3390 - 3403
  • [5] A multichannel learning-based approach for sound source separation in reverberant environments
    You-Siang Chen
    Zi-Jie Lin
    Mingsian R. Bai
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2021
  • [6] A multichannel learning-based approach for sound source separation in reverberant environments
    Chen, You-Siang
    Lin, Zi-Jie
    Bai, Mingsian R.
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2021, 2021 (01)
  • [7] Enhanced-Resolution Learning-Based Direction of Arrival Estimation by Programmable Metasurface
    Meftah, Nawel
    Ratni, Badreddine
    El Korso, Mohammed Nabil
    Burokur, Shah Nawaz
    [J]. ADVANCED ELECTRONIC MATERIALS, 2024,
  • [8] Direction of Arrival Estimation Based on Subband Weighting for Noisy Conditions
    Xue, Wei
    Liu, Wenju
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 142 - 145
  • [9] DIRECTION OF ARRIVAL ESTIMATION IN HIGHLY REVERBERANT ENVIRONMENTS USING SOFT TIME-FREQUENCY MASK
    Tourbabin, Vladimir
    Donley, Jacob
    Rafaely, Boaz
    Mehra, Ravish
    [J]. 2019 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2019, : 383 - 387
  • [10] Direction of Arrival Estimation for Reverberant Speech Based on Enhanced Decomposition of the Direct Sound
    Madmoni, Lior
    Rafaely, Boaz
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2019, 13 (01) : 131 - 142