Sound source localization using deep learning models

被引:72
|
作者
Yalta N. [1 ]
Nakadai K. [2 ]
Ogata T. [1 ,3 ]
机构
[1] Intermedia Art and Science Department, Waseda University, 3-4-1 Ohkubo, Shinjuku, 169-8555, Tokyo
[2] Honda Research Institute Japan Co., Ltd, Tokyo Institute of Technology, 8-1 Honcho, Wako, 351-0188, Saitama
[3] Faculty of Science and Engineering, Waseda University, 3-4-1 Ohkubo, Shinjuku, 169-8555, Tokyo
关键词
Deep learning; Deep residual networks; Sound source localization;
D O I
10.20965/jrm.2017.p0037
中图分类号
学科分类号
摘要
This study proposes the use of a deep neural network to localize a sound source using an array of microphones in a reverberant environment. During the last few years, applications based on deep neural networks have performed various tasks such as image classification or speech recognition to levels that exceed even human capabilities. In our study, we employ deep residual networks, which have recently shown remarkable performance in image classification tasks even when the training period is shorter than that of other models. Deep residual networks are used to process audio input similar to multiple signal classification (MUSIC) methods. We show that with end-to-end training and generic preprocessing, the performance of deep residual networks not only surpasses the block level accuracy of linear models on nearly clean environments but also shows robustness to challenging conditions by exploiting the time delay on power information. © 2017, Fuji Technology Press. All rights reserved.
引用
收藏
页码:37 / 48
页数:11
相关论文
共 50 条
  • [41] Sound Source Localization Using Piezoelectric Acoustic Metasurfaces
    Jin-Cheng Gu
    Wei Lin
    Cai-Xia Kan
    Acoustics Australia, 2020, 48 : 455 - 461
  • [42] Sound Source Localization Using Sparse Coding and SOM
    Kim, Hong-shik
    Choi, Jong-suk
    2009 IEEE CONFERENCE ON EMERGING TECHNOLOGIES & FACTORY AUTOMATION (EFTA 2009), 2009,
  • [43] Source localization by matching sound intensity with a vertical array in the deep ocean
    Liu, Wenxu
    Yang, Yixin
    Lu, Liangang
    Shi, Yang
    Liu, Zongwei
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2019, 146 (06): : EL477 - EL481
  • [44] Sound Source Localization Using Piezoelectric Acoustic Metasurfaces
    Gu, Jin-Cheng
    Lin, Wei
    Kan, Cai-Xia
    ACOUSTICS AUSTRALIA, 2020, 48 (03) : 455 - 461
  • [45] Deep learning-based approach to improve the accuracy of time difference of arrival - based sound source localization
    Jeong, Iljoo
    Huh, Hyunsuk
    Jung, In-Jee
    Lee, Seungchul
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2024, 43 (02): : 178 - 183
  • [46] SPATIAL FEATURE LEARNING FOR ROBUST BINAURAL SOUND SOURCE LOCALIZATION USING A COMPOSITE FEATURE VECTOR
    Wu, Xiang
    Talagala, Dumidu S.
    Zhang, Wen
    Abhayapala, Thushara D.
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 6320 - 6324
  • [47] Deep-learning source localization using autocorrelation functions from a single hydrophone in deep ocean
    Liu, Yining
    Niu, Haiqiang
    Li, Zhenglin
    Wang, Mengyuan
    JASA EXPRESS LETTERS, 2021, 1 (03):
  • [48] DISCRIMINATIVE MULTIPLE SOUND SOURCE LOCALIZATION BASED ON DEEP NEURAL NETWORKS USING INDEPENDENT LOCATION MODEL
    Takeda, Ryu
    Komatani, Kazunori
    2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 603 - 609
  • [49] Heart Sound Classification Using Wavelet Analysis Approaches and Ensemble of Deep Learning Models
    Lee, Jin-A
    Kwak, Keun-Chang
    APPLIED SCIENCES-BASEL, 2023, 13 (21):
  • [50] Multiple source localization using learning-based sparse estimation in deep ocean
    Liu, Yining
    Niu, Haiqiang
    Yang, Sisi
    Li, Zhenglin
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2021, 150 (05): : 3773 - 3786