Sound source localization using deep learning models

被引：72

作者：

Yalta N. ^{[1
]}

Nakadai K. ^{[2
]}

Ogata T. ^{[1
,3
]}

机构：

[1] Intermedia Art and Science Department, Waseda University, 3-4-1 Ohkubo, Shinjuku, 169-8555, Tokyo

[2] Honda Research Institute Japan Co., Ltd, Tokyo Institute of Technology, 8-1 Honcho, Wako, 351-0188, Saitama

[3] Faculty of Science and Engineering, Waseda University, 3-4-1 Ohkubo, Shinjuku, 169-8555, Tokyo

来源：

| 2017年 / Fuji Technology Press卷 / 29期

关键词：

Deep learning; Deep residual networks; Sound source localization;

D O I：

10.20965/jrm.2017.p0037

中图分类号：

学科分类号：

摘要：

This study proposes the use of a deep neural network to localize a sound source using an array of microphones in a reverberant environment. During the last few years, applications based on deep neural networks have performed various tasks such as image classification or speech recognition to levels that exceed even human capabilities. In our study, we employ deep residual networks, which have recently shown remarkable performance in image classification tasks even when the training period is shorter than that of other models. Deep residual networks are used to process audio input similar to multiple signal classification (MUSIC) methods. We show that with end-to-end training and generic preprocessing, the performance of deep residual networks not only surpasses the block level accuracy of linear models on nearly clean environments but also shows robustness to challenging conditions by exploiting the time delay on power information. © 2017, Fuji Technology Press. All rights reserved.

引用

页码：37 / 48

页数：11

共 50 条

[41] Sound Source Localization Using Piezoelectric Acoustic Metasurfaces
Jin-Cheng Gu
Wei Lin
Cai-Xia Kan
Acoustics Australia, 2020, 48 : 455 - 461
[42] Sound Source Localization Using Sparse Coding and SOM
Kim, Hong-shik
Choi, Jong-suk
2009 IEEE CONFERENCE ON EMERGING TECHNOLOGIES & FACTORY AUTOMATION (EFTA 2009), 2009,
[43] Source localization by matching sound intensity with a vertical array in the deep ocean
Liu, Wenxu
Yang, Yixin
Lu, Liangang
Shi, Yang
Liu, Zongwei
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2019, 146 (06): : EL477 - EL481
[44] Sound Source Localization Using Piezoelectric Acoustic Metasurfaces
Gu, Jin-Cheng
Lin, Wei
Kan, Cai-Xia
ACOUSTICS AUSTRALIA, 2020, 48 (03) : 455 - 461
[45] Deep learning-based approach to improve the accuracy of time difference of arrival - based sound source localization
Jeong, Iljoo
Huh, Hyunsuk
Jung, In-Jee
Lee, Seungchul
JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2024, 43 (02): : 178 - 183
[46] SPATIAL FEATURE LEARNING FOR ROBUST BINAURAL SOUND SOURCE LOCALIZATION USING A COMPOSITE FEATURE VECTOR
Wu, Xiang
Talagala, Dumidu S.
Zhang, Wen
Abhayapala, Thushara D.
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 6320 - 6324
[47] Deep-learning source localization using autocorrelation functions from a single hydrophone in deep ocean
Liu, Yining
Niu, Haiqiang
Li, Zhenglin
Wang, Mengyuan
JASA EXPRESS LETTERS, 2021, 1 (03):
[48] DISCRIMINATIVE MULTIPLE SOUND SOURCE LOCALIZATION BASED ON DEEP NEURAL NETWORKS USING INDEPENDENT LOCATION MODEL
Takeda, Ryu
Komatani, Kazunori
2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 603 - 609
[49] Heart Sound Classification Using Wavelet Analysis Approaches and Ensemble of Deep Learning Models
Lee, Jin-A
Kwak, Keun-Chang
APPLIED SCIENCES-BASEL, 2023, 13 (21):
[50] Multiple source localization using learning-based sparse estimation in deep ocean
Liu, Yining
Niu, Haiqiang
Yang, Sisi
Li, Zhenglin
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2021, 150 (05): : 3773 - 3786

← 1 2 3 4 5 →