Sound source localization method based time-domain signal feature using deep learning

被引:5
|
作者
Tang, Jun [1 ]
Sun, Xinmiao [1 ]
Yan, Lei [2 ]
Qu, Yang [1 ]
Wang, Tao [1 ]
Yue, Yuan [1 ]
机构
[1] Tianjin Univ, Sch Civil Engn, Tianjin 300072, Peoples R China
[2] China Acad Launch Vehicle Technol, Beijing 100076, Peoples R China
基金
国家重点研发计划;
关键词
Sound source localization; Microphone array; Time-domain features; Convolutional nerual network;
D O I
10.1016/j.apacoust.2023.109626
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Deep learning, as the most commonly used machine learning algorithm, is widely used in various fields. In the field of acoustics, deep learning methods are combined with frequency-domain features of signals to locate sound sources. The commonly frequency domain features include microphones array Cross-spectral-Matrix(CSM) and Short Time Fourier Transform(STFT). However, the use of frequency-domain features often leads to the loss of partial signal information and increases the computational complexity. This paper proposed a novel sound source localization algorithm based on time-domain features, which uses convolutional neural network(CNN) as a medium to achieve mapping from time-domain features to sound source locations. This method does not rely on any basic signal processing algorithm, and directly uses time-domain sampling points as network inputs for sound source localization. The application simulation shows that the proposed method can achieve precise localization and low side-lobe effect under different testing conditions. Once the network training is completed, the testing accuracy under different conditions is above 95%, with a maximum of 100%.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Computationally efficient transparent sound source for the finite-difference time-domain method
    Toyoda, Masahiro
    Yatabe, Kohei
    ACOUSTICAL SCIENCE AND TECHNOLOGY, 2023, 44 (05) : 371 - 382
  • [22] Three-dimensional grid-free sound source localization method based on deep learning
    Zhao, Yunjie
    He, Yansong
    Chen, Hao
    Zhang, Zhifei
    Xu, Zhongming
    APPLIED ACOUSTICS, 2025, 227
  • [23] A DEEP NEURAL NETWORK FOR TIME-DOMAIN SIGNAL RECONSTRUCTION
    Wang, Yuxuan
    Wang, DeLiang
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4390 - 4394
  • [24] Uncertainty Estimation for Sound Source Localization With Deep Learning
    Pi, Rendong
    Yu, Xiang
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2025, 74
  • [25] A survey of sound source localization with deep learning methods
    Grumiaux, Pierre-Amaury
    Kitic, Srdan
    Girin, Laurent
    Guerin, Alexandre
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2022, 152 (01): : 107 - 151
  • [26] Separation of non-stationary multi-source sound field based on the interpolated time-domain equivalent source method
    Bi, Chuan-Xing
    Geng, Lin
    Zhang, Xiao-Zheng
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2016, 72-73 : 745 - 761
  • [27] Application of deep learning for accurate source localization using sound intensity vector
    Jeong, Iljoo
    Jung, In-Jee
    Lee, Seungchul
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2024, 43 (01): : 72 - 77
  • [28] TIME-DOMAIN GENERALIZED CROSS CORRELATION PHASE TRANSFORM SOUND SOURCE LOCALIZATION FOR SMALL MICROPHONE ARRAYS
    Van den Broeck, Bert
    Bertrand, Alexander
    Karsmakers, Peter
    Vanrumste, Bart
    Van Hamme, Hugo
    Moonen, Marc
    2012 5TH EUROPEAN DSP EDUCATION AND RESEARCH CONFERENCE (EDERC), 2012, : 76 - 80
  • [29] Advanced Spectroscopy Time-Domain Signal Simulator for the Development of Machine and Deep Learning Algorithms
    Bykhovsky, Dima
    Chen, Zikang
    Huang, Yiwei
    Zheng, Xiaoying
    Trigano, Tom
    IEEE SENSORS LETTERS, 2025, 9 (04)
  • [30] Research of Terahertz Time-Domain Spectral Identification Based on Deep Learning
    Hu Qi-feng
    Cai Jian
    SPECTROSCOPY AND SPECTRAL ANALYSIS, 2021, 41 (01) : 94 - 99