Sound source localization using deep learning models

被引:72
|
作者
Yalta N. [1 ]
Nakadai K. [2 ]
Ogata T. [1 ,3 ]
机构
[1] Intermedia Art and Science Department, Waseda University, 3-4-1 Ohkubo, Shinjuku, 169-8555, Tokyo
[2] Honda Research Institute Japan Co., Ltd, Tokyo Institute of Technology, 8-1 Honcho, Wako, 351-0188, Saitama
[3] Faculty of Science and Engineering, Waseda University, 3-4-1 Ohkubo, Shinjuku, 169-8555, Tokyo
关键词
Deep learning; Deep residual networks; Sound source localization;
D O I
10.20965/jrm.2017.p0037
中图分类号
学科分类号
摘要
This study proposes the use of a deep neural network to localize a sound source using an array of microphones in a reverberant environment. During the last few years, applications based on deep neural networks have performed various tasks such as image classification or speech recognition to levels that exceed even human capabilities. In our study, we employ deep residual networks, which have recently shown remarkable performance in image classification tasks even when the training period is shorter than that of other models. Deep residual networks are used to process audio input similar to multiple signal classification (MUSIC) methods. We show that with end-to-end training and generic preprocessing, the performance of deep residual networks not only surpasses the block level accuracy of linear models on nearly clean environments but also shows robustness to challenging conditions by exploiting the time delay on power information. © 2017, Fuji Technology Press. All rights reserved.
引用
收藏
页码:37 / 48
页数:11
相关论文
共 50 条
  • [1] A survey of sound source localization with deep learning methods
    Grumiaux, Pierre-Amaury
    Kitic, Srdan
    Girin, Laurent
    Guerin, Alexandre
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2022, 152 (01): : 107 - 151
  • [2] Application of deep learning for accurate source localization using sound intensity vector
    Jeong, Iljoo
    Jung, In-Jee
    Lee, Seungchul
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2024, 43 (01): : 72 - 77
  • [3] SSLIDE: SOUND SOURCE LOCALIZATION FOR INDOORS BASED ON DEEP LEARNING
    Wu, Yifan
    Ayyalasomayajula, Roshan
    Bianco, Michael J.
    Bharadia, Dinesh
    Gerstoft, Peter
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 4680 - 4684
  • [4] Phased microphone array for sound source localization with deep learning
    Ma W.
    Liu X.
    Aerospace Systems, 2019, 2 (2) : 71 - 81
  • [5] Noise source localization using deep learning
    Zhou, Jie
    Mi, Binbin
    Xia, Jianghai
    Zhang, Hao
    Liu, Ya
    Chen, Xinhua
    Guan, Bo
    Hong, Yu
    Ma, Yulong
    GEOPHYSICAL JOURNAL INTERNATIONAL, 2024, 238 (01) : 513 - 536
  • [6] QuadCOINS-Network: A Deep Learning Approach to Sound Source Localization
    Ciccia, Simone
    Scionti, Alberto
    Vitali, Giacomo
    Terzo, Olivier
    COMPLEX, INTELLIGENT AND SOFTWARE INTENSIVE SYSTEMS, 2021, 1194 : 130 - 141
  • [7] Deep Learning-Based Dereverberation for Sound Source Localization with Beamforming
    Zhai, Qingbo
    Ning, Fangli
    Hou, Hongjie
    Wei, Juan
    Su, Zhaojing
    JOURNAL OF THEORETICAL AND COMPUTATIONAL ACOUSTICS, 2024, 32 (01):
  • [8] Deep Learning Aided Sound Source Localization: A Nonsynchronous Measurement Approach
    Chen, Guitong
    Chen, Long
    Sun, Weize
    Li, Qiang
    Huang, Lei
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [9] Sound source localization method based time-domain signal feature using deep learning
    Tang, Jun
    Sun, Xinmiao
    Yan, Lei
    Qu, Yang
    Wang, Tao
    Yue, Yuan
    APPLIED ACOUSTICS, 2023, 213
  • [10] Deep-Learning-Assisted Sound Source Localization From a Flying Drone
    Wang, Lin
    Cavallaro, Andrea
    IEEE SENSORS JOURNAL, 2022, 22 (21) : 20828 - 20838