Exploiting Deep Neural Networks and Head Movements for Robust Binaural Localization of Multiple Sources in Reverberant Environments

被引:87
|
作者
Ma, Ning [1 ]
May, Tobias [2 ]
Brown, Guy J. [1 ]
机构
[1] Univ Sheffield, Dept Comp Sci, Sheffield S1 4DP, S Yorkshire, England
[2] Tech Univ Denmark, Hearing Syst Grp, DK-2800 Lyngby, Denmark
关键词
Binaural sound source localisation; deep neural networks; head movements; machine hearing; multi-conditional training; reverberation; PROBABILISTIC MODEL; CUES;
D O I
10.1109/TASLP.2017.2750760
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a novel machine-hearing system that exploits deep neural networks (DNNs) and head movements for robust binaural localization of multiple sources in reverberant environments. DNNs are used to learn the relationship between the source azimuth and binaural cues, consisting of the complete cross-correlation function (CCF) and interaural level differences (ILDs). In contrast to many previous binaural hearing systems, the proposed approach is not restricted to localization of sound sources in the frontal hemifield. Due to the similarity of binaural cues in the frontal and rear hemifields, front-back confusions often occur. To address this, a head movement strategy is incorporated in the localization model to help reduce the front-back errors. The proposed DNN system is compared to a Gaussian-mixture-model-based system that employs interaural time differences (ITDs) and ILDs as localization features. Our experiments show that the DNN is able to exploit information in the CCF that is not available in the ITD cue, which together with head movements substantially improves localization accuracies under challenging acoustic scenarios, in which multiple talkers and room reverberation are present.
引用
下载
收藏
页码:2444 / 2453
页数:10
相关论文
共 38 条
  • [21] Deep Neural Networks for wireless localization in indoor and outdoor environments
    Zhang, Wei
    Liu, Kan
    Zhang, Weidong
    Zhang, Youmei
    Gu, Jason
    NEUROCOMPUTING, 2016, 194 : 279 - 287
  • [22] Deep Neural Networks for Multiple Speaker Detection and Localization
    He, Weipeng
    Motlicek, Petr
    Odobez, Jean-Marc
    2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2018, : 74 - 79
  • [23] A Binaural Deep Neural Networks Parameter Mask for the Robust Automatic Speech Recognition System
    Jiang, Yi
    Liu, Runsheng
    2016 INTERNATIONAL CONFERENCE ON NETWORK AND INFORMATION SYSTEMS FOR COMPUTERS (ICNISC), 2016, : 352 - 356
  • [24] Two-Stage Monaural Source Separation in Reverberant Room Environments Using Deep Neural Networks
    Sun, Yang
    Wang, Wenwu
    Chambers, Jonathon
    Naqvi, Syed Mohsen
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (01) : 125 - 139
  • [25] Simultaneous Localization of Multiple GNSS Interference Sources via Neural Networks
    Besson, David
    PROCEEDINGS OF THE 30TH INTERNATIONAL TECHNICAL MEETING OF THE SATELLITE DIVISION OF THE INSTITUTE OF NAVIGATION (ION GNSS+ 2017), 2017, : 2812 - 2829
  • [26] Binaural lateral localization of multiple sources in real environments using a kurtosis-driven split-EM algorithm
    Reche-Lopez, P.
    Perez-Lorenzo, J. M.
    Rivas, F.
    Viciana-Abad, R.
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2018, 69 : 137 - 146
  • [27] EXPLOITING SYNCHRONY SPECTRA AND DEEP NEURAL NETWORKS FOR NOISE-ROBUST AUTOMATIC SPEECH RECOGNITION
    Ma, Ning
    Marxer, Ricard
    Barker, Jon
    Brown, Guy J.
    2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2015, : 490 - 495
  • [28] Performance analysis of deep neural networks for direction of arrival estimation of multiple sources
    Chen, Min
    Mao, Xingpeng
    Wang, Xiuhong
    IET SIGNAL PROCESSING, 2023, 17 (03)
  • [29] Deep Multiple Instance Convolutional Neural Networks for Learning Robust Scene Representations
    Li, Zhili
    Xu, Kai
    Xie, Jiafen
    Bi, Qi
    Qin, Kun
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2020, 58 (05): : 3685 - 3702
  • [30] Fast and robust multiple ColorChecker detection using deep convolutional neural networks
    Marrero Fernandez, Pedro D.
    Guerrero Pena, Fidel A.
    Ren, Tsang Ing
    Leandro, Jorge J. G.
    IMAGE AND VISION COMPUTING, 2019, 81 : 15 - 24