Sound source localization method based time-domain signal feature using deep learning

被引:5
|
作者
Tang, Jun [1 ]
Sun, Xinmiao [1 ]
Yan, Lei [2 ]
Qu, Yang [1 ]
Wang, Tao [1 ]
Yue, Yuan [1 ]
机构
[1] Tianjin Univ, Sch Civil Engn, Tianjin 300072, Peoples R China
[2] China Acad Launch Vehicle Technol, Beijing 100076, Peoples R China
基金
国家重点研发计划;
关键词
Sound source localization; Microphone array; Time-domain features; Convolutional nerual network;
D O I
10.1016/j.apacoust.2023.109626
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Deep learning, as the most commonly used machine learning algorithm, is widely used in various fields. In the field of acoustics, deep learning methods are combined with frequency-domain features of signals to locate sound sources. The commonly frequency domain features include microphones array Cross-spectral-Matrix(CSM) and Short Time Fourier Transform(STFT). However, the use of frequency-domain features often leads to the loss of partial signal information and increases the computational complexity. This paper proposed a novel sound source localization algorithm based on time-domain features, which uses convolutional neural network(CNN) as a medium to achieve mapping from time-domain features to sound source locations. This method does not rely on any basic signal processing algorithm, and directly uses time-domain sampling points as network inputs for sound source localization. The application simulation shows that the proposed method can achieve precise localization and low side-lobe effect under different testing conditions. Once the network training is completed, the testing accuracy under different conditions is above 95%, with a maximum of 100%.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] A deep reinforcement learning based searching method for source localization
    Zhao, Yong
    Chen, Bin
    Wang, XiangHan
    Zhu, Zhengqiu
    Wang, Yiduo
    Cheng, Guangquan
    Wang, Rui
    Wang, Rongxiao
    He, Ming
    Liu, Yu
    INFORMATION SCIENCES, 2022, 588 : 67 - 81
  • [32] A deep reinforcement learning based searching method for source localization
    College of Systems Engineering, National University of Defense Technology, 109 Deya Road, Kaifu District, Changsha City
    Hunan Province, China
    不详
    不详
    Inf Sci, 2022, (67-81): : 67 - 81
  • [33] Pulse Wave Signal Feature Recognition Based on Time-domain Differential Period Ratio
    Fan Bao-cun
    Wang Yan
    Huang Chen-chen
    Ge Zi-yang
    Jin Ping
    ACTA PHOTONICA SINICA, 2020, 49 (12)
  • [34] Acoustic holography method for measuring moving sound source with correction for Doppler effect in time-domain
    Yang Dian-Ge
    Luo Yu-Gong
    Li Bing
    Li Ke-Qiang
    Lian Xiao-Min
    ACTA PHYSICA SINICA, 2010, 59 (07) : 4738 - 4747
  • [35] A Deep Learning Inversion Method for Airborne Time-Domain Electromagnetic Data Using Convolutional Neural Network
    Yu, Xiaodong
    Zhang, Peng
    Yu, Xi
    APPLIED SCIENCES-BASEL, 2024, 14 (19):
  • [36] A time-domain inverse technique for the localization and quantification of rotating sound sources
    Zhang, Xiao-Zheng
    Bi, Chuan-Xing
    Zhang, Yong-Bin
    Xu, Liang
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2017, 90 : 15 - 29
  • [37] Sound Source Localization for HRI Using FOC-Based Time Difference Feature and Spatial Grid Matching
    Li, Xiaofei
    Liu, Hong
    IEEE TRANSACTIONS ON CYBERNETICS, 2013, 43 (04) : 1199 - 1212
  • [38] Deep Learning-Based Indoor Localization Using Adjacent Received Signal Strength and Domain Knowledge
    Zhang, Guangyi
    Hou, Zhanwei
    Li, Yonghui
    Vucetic, Branka
    2022 20TH MEDITERRANEAN COMMUNICATION AND COMPUTER NETWORKING CONFERENCE (MEDCOMNET), 2022,
  • [39] Deep Learning-Based Indoor Localization Using Adjacent Received Signal Strength and Domain Knowledge
    Zhang, Guangyi
    Hou, Zhanwei
    Li, Yonghui
    Vucetic, Branka
    2022 20th Mediterranean Communication and Computer Networking Conference, MedComNet 2022, 2022, : 25 - 30
  • [40] Acoustic Source Localization in the Circular Harmonic Domain Using Deep Learning Architecture
    SongGong, Kunkun
    Wang, Wenwu
    Chen, Huawei
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 2475 - 2491