Binaural sound localization based on deep neural network and affinity propagation clustering in mismatched HRTF condition

被引:0
|
作者
Jing Wang
Jin Wang
Kai Qian
Xiang Xie
Jingming Kuang
机构
[1] Beijing Institute of Technology,
关键词
Deep neural network; Clustering; Affinity propagation; Binaural localization;
D O I
暂无
中图分类号
学科分类号
摘要
Binaural sound source localization is an important and widely used perceptually based method and it has been applied to machine learning studies by many researchers based on head-related transfer function (HRTF). Because the HRTF is closely related to human physiological structure, the HRTFs vary between individuals. Related machine learning studies to date tend to focus on binaural localization in reverberant or noisy environments, or in conditions with multiple simultaneously active sound sources. In contrast, mismatched HRTF condition, in which the HRTFs used to generate the training and test sets are different, is rarely studied. This mismatch leads to a degradation of localization performance. A basic solution to this problem is to introduce more data to improve generalization performance, which requires a lot. However, simply increasing the data volume will result in data-inefficiency. In this paper, we propose a data-efficient method based on deep neural network (DNN) and clustering to improve binaural localization performance in the mismatched HRTF condition. Firstly, we analyze the relationship between binaural cues and the sound source localization with a classification DNN. Different HRTFs are used to generate training and test sets, respectively. On this basis, we study the localization performance of DNN model trained by each training set on different test sets. The result shows that the localization performance of the same model on different test sets is different, while the localization performance of different models on the same test set may be similar. The result also shows a clustering trend. Secondly, different HRTFs are divided into several clusters. Finally, the corresponding HRTFs of each cluster center are selected to generate a new training set and to train a more generalized DNN model. The experimental results show that the proposed method achieves better generalization performance than the baseline methods in the mismatched HRTF condition and has almost equal performance to the DNN trained with a large number of HRTFs, which means the proposed method is data-efficient.
引用
收藏
相关论文
共 50 条
  • [41] Deep Neural Network-Based Fusion Localization Using Smartphones
    Yan, Suqing
    Su, Yalan
    Xiao, Jianming
    Luo, Xiaonan
    Ji, Yuanfa
    Bin Ghazali, Kamarul Hawari
    SENSORS, 2023, 23 (21)
  • [42] Localization of Internet of Things Network via Deep Neural Network Based Matrix Completion
    Kim, Sunwoo
    Shim, Byonghyo
    11TH INTERNATIONAL CONFERENCE ON ICT CONVERGENCE: DATA, NETWORK, AND AI IN THE AGE OF UNTACT (ICTC 2020), 2020, : 1766 - 1770
  • [43] Filterbank Learning for Deep Neural Network Based Polyphonic Sound Event Detection
    Cakir, Emre
    Ozan, Ezgi Can
    Virtanen, Tuomas
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 3399 - 3406
  • [44] Sound event classification using deep neural network based transfer learning
    Lim, Hyungjun
    Kim, Myung Jong
    Kim, Hoirin
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2016, 35 (02): : 143 - 148
  • [45] Combined deep belief network in deep learning with affinity propagation clustering algorithm for roller bearings fault diagnosis without data label
    Xu, Fan
    Tse, Peter W.
    JOURNAL OF VIBRATION AND CONTROL, 2019, 25 (02) : 473 - 482
  • [46] The Construction of Sound Speed Field Based on Back Propagation Neural Network in the Global Ocean
    Wang, Junting
    Xu, Tianhe
    Nie, Wenfeng
    Yu, Xiaokang
    MARINE GEODESY, 2020, 43 (06) : 621 - 642
  • [47] Fuzzy Clustering and Deep Neural Network-Based Image Segmentation Algorithm
    Lin, Zhi-jie
    Zhang, Shi-jing
    COMPUTER SCIENCE AND TECHNOLOGY (CST2016), 2017, : 711 - 717
  • [48] A Deep Clustering Algorithm Based on Self-organizing Map Neural Network
    Tao, Yanling
    Li, Ying
    Lin, Xianghong
    INTELLIGENT COMPUTING METHODOLOGIES, ICIC 2018, PT III, 2018, 10956 : 182 - 192
  • [49] Clustering Based Dose Distribution Prediction Using Deep Convolutional Neural Network
    Ma, M.
    Buyyounouski, M.
    Vasudevan, V.
    Xing, L.
    Yang, Y.
    MEDICAL PHYSICS, 2019, 46 (06) : E358 - E358
  • [50] DNC: A Deep Neural Network-based Clustering-oriented Network Embedding Algorithm
    Li, Bentian
    Pi, Dechang
    Lin, Yunxia
    Cui, Lin
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2021, 173