Binaural sound localization based on deep neural network and affinity propagation clustering in mismatched HRTF condition

被引:0
|
作者
Jing Wang
Jin Wang
Kai Qian
Xiang Xie
Jingming Kuang
机构
[1] Beijing Institute of Technology,
关键词
Deep neural network; Clustering; Affinity propagation; Binaural localization;
D O I
暂无
中图分类号
学科分类号
摘要
Binaural sound source localization is an important and widely used perceptually based method and it has been applied to machine learning studies by many researchers based on head-related transfer function (HRTF). Because the HRTF is closely related to human physiological structure, the HRTFs vary between individuals. Related machine learning studies to date tend to focus on binaural localization in reverberant or noisy environments, or in conditions with multiple simultaneously active sound sources. In contrast, mismatched HRTF condition, in which the HRTFs used to generate the training and test sets are different, is rarely studied. This mismatch leads to a degradation of localization performance. A basic solution to this problem is to introduce more data to improve generalization performance, which requires a lot. However, simply increasing the data volume will result in data-inefficiency. In this paper, we propose a data-efficient method based on deep neural network (DNN) and clustering to improve binaural localization performance in the mismatched HRTF condition. Firstly, we analyze the relationship between binaural cues and the sound source localization with a classification DNN. Different HRTFs are used to generate training and test sets, respectively. On this basis, we study the localization performance of DNN model trained by each training set on different test sets. The result shows that the localization performance of the same model on different test sets is different, while the localization performance of different models on the same test set may be similar. The result also shows a clustering trend. Secondly, different HRTFs are divided into several clusters. Finally, the corresponding HRTFs of each cluster center are selected to generate a new training set and to train a more generalized DNN model. The experimental results show that the proposed method achieves better generalization performance than the baseline methods in the mismatched HRTF condition and has almost equal performance to the DNN trained with a large number of HRTFs, which means the proposed method is data-efficient.
引用
收藏
相关论文
共 50 条
  • [31] Deblurring of Sound Source Orientation Recognition Based on Deep Neural Network
    Wang, Tong
    Ren, Haoran
    Su, Xiruo
    Tao, Liurong
    Zhu, Zhaolin
    Ye, Lingyun
    Lou, Weitao
    SENSORS, 2022, 22 (20)
  • [32] Vehicle Interior Sound Quality Prediction Based on Back Propagation Neural Network
    Tan, Gang-Ping
    Wang, Deng-Feng
    Li, Qian
    2011 2ND INTERNATIONAL CONFERENCE ON CHALLENGES IN ENVIRONMENTAL SCIENCE AND COMPUTER ENGINEERING (CESCE 2011), VOL 11, PT A, 2011, 11 : 471 - 477
  • [33] Prediction and Evaluation of Park Sound Comfort Based on Back Propagation Neural Network
    Fan, Qindong
    He, Yujie
    Zhang, Chenming
    Yang, Xiaoyu
    POLISH JOURNAL OF ENVIRONMENTAL STUDIES, 2022, 31 (05): : 4623 - 4639
  • [34] Deep neural network-based clustering technique for secure IIoT
    Mukherjee, Amrit
    Goswami, Pratik
    Yang, Lixia
    Sah Tyagi, Sumarga K.
    Samal, U. C.
    Mohapatra, S. K.
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (20): : 16109 - 16117
  • [35] Deep neural network-based clustering technique for secure IIoT
    Amrit Mukherjee
    Pratik Goswami
    Lixia Yang
    Sumarga K. Sah Tyagi
    U. C. Samal
    S. K. Mohapatra
    Neural Computing and Applications, 2020, 32 : 16109 - 16117
  • [36] Hybrid Convolutional Neural Network-Transformer Model for End-to-End Binaural Sound Source Localization in Reverberant Environments
    Chen, Xinyi
    Zhao, Lijia
    Cui, Jie
    Li, Hua
    Wang, Xiaodong
    IEEE Access, 2025, 13 : 36701 - 36713
  • [37] Sound-Source Localization System Based on Neural Network for Mobile Robots
    Geng, Yang
    Jung, Jongdae
    Seol, Donggug
    2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 3126 - 3130
  • [38] Convolutional Neural Network Based Indoor Microphone Array Sound Source Localization
    Chen, Jiao
    Tao, Zhang
    Sun Jianhong
    LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (08)
  • [39] Robust offline trained neural network for TDOA based sound source localization
    Chetupalli, Srikanth Raj
    Ram, Ashwin
    Thippur, Sreenivas, V
    2018 TWENTY FOURTH NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2018,
  • [40] Deep Neural Network Based on Feature Fusion for Indoor Wireless Localization
    Chen, Shaojian
    Zhu, Qiongqiong
    Li, Zihao
    Long, Yunliang
    2018 INTERNATIONAL CONFERENCE ON MICROWAVE AND MILLIMETER WAVE TECHNOLOGY (ICMMT2018), 2018,