Sound source localization method based time-domain signal feature using deep learning

被引:5
|
作者
Tang, Jun [1 ]
Sun, Xinmiao [1 ]
Yan, Lei [2 ]
Qu, Yang [1 ]
Wang, Tao [1 ]
Yue, Yuan [1 ]
机构
[1] Tianjin Univ, Sch Civil Engn, Tianjin 300072, Peoples R China
[2] China Acad Launch Vehicle Technol, Beijing 100076, Peoples R China
基金
国家重点研发计划;
关键词
Sound source localization; Microphone array; Time-domain features; Convolutional nerual network;
D O I
10.1016/j.apacoust.2023.109626
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Deep learning, as the most commonly used machine learning algorithm, is widely used in various fields. In the field of acoustics, deep learning methods are combined with frequency-domain features of signals to locate sound sources. The commonly frequency domain features include microphones array Cross-spectral-Matrix(CSM) and Short Time Fourier Transform(STFT). However, the use of frequency-domain features often leads to the loss of partial signal information and increases the computational complexity. This paper proposed a novel sound source localization algorithm based on time-domain features, which uses convolutional neural network(CNN) as a medium to achieve mapping from time-domain features to sound source locations. This method does not rely on any basic signal processing algorithm, and directly uses time-domain sampling points as network inputs for sound source localization. The application simulation shows that the proposed method can achieve precise localization and low side-lobe effect under different testing conditions. Once the network training is completed, the testing accuracy under different conditions is above 95%, with a maximum of 100%.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] A Time-domain Unsupervised Learning Based Sound Source Localization Method
    Huang, Yankun
    Wu, Xihong
    Qu, Tianshu
    2020 IEEE 3RD INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND SIGNAL PROCESSING (ICICSP 2020), 2020, : 26 - 32
  • [2] A Time-domain End-to-End Method for Sound Source Localization Using Multi-Task Learning
    Huang, Yankun
    Wu, Xihong
    Qu, Tianshu
    2019 2ND IEEE INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND SIGNAL PROCESSING (ICICSP), 2019, : 52 - 56
  • [3] LoRa Signal Demodulation Using Deep Learning, a Time-Domain Approach
    Dakic, Kosta
    Al Homssi, Bassel
    Al-Hourani, Akram
    Lech, Margaret
    2021 IEEE 93RD VEHICULAR TECHNOLOGY CONFERENCE (VTC2021-SPRING), 2021,
  • [4] Bayesian time-domain multiple sound source localization for a stochastic machine
    Frisch, Raphael
    Faix, Marvin
    Droulez, Jacques
    Girin, Laurent
    Mazer, Emmanuel
    2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [6] Sound source identification and sound radiation modeling in a moving medium using the time-domain equivalent source method
    Zhang, Xiao-Zheng
    Bi, Chuan-Xing
    Zhang, Yong-Bin
    Xu, Liang
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2015, 137 (05): : 2678 - 2686
  • [7] Sound source localization using deep learning models
    Yalta N.
    Nakadai K.
    Ogata T.
    2017, Fuji Technology Press (29) : 37 - 48
  • [8] Impulsive sound source localization using peak and RMS estimation of the time-domain beamformer output
    Seo, Dae-Hoon
    Choi, Jung-Woo
    Kim, Yang-Hann
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2014, 49 (1-2) : 95 - 105
  • [9] Sound source feature defuzzification method based on end-to-end deep learning
    Feng L.
    Zan M.
    Xu Z.
    Zhang Z.
    Li Z.
    Zhendong yu Chongji/Journal of Vibration and Shock, 2023, 42 (21): : 133 - 141
  • [10] TIME-DOMAIN AUDIO SOURCE SEPARATION BASED ON GAUSSIAN PROCESSES WITH DEEP KERNEL LEARNING
    Nugraha, Aditya Arie
    Di Carlo, Diego
    Bando, Yoshiaki
    Fontaine, Mathieu
    Yoshii, Kazuyoshi
    2023 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, WASPAA, 2023,