Binaural Sound Source Distance Estimation and Localization for a Moving Listener

被引:4
|
作者
Krause, Daniel Aleksander [1 ]
Garcia-Barrios, Guillermo [2 ]
Politis, Archontis [1 ]
Mesaros, Annamaria [1 ]
机构
[1] Tampere Univ, Comp Sci, Tampere 33720, Finland
[2] Univ Politecn Madrid, Grp Acoust & MultiMedia Applicat, Madrid 33014, Spain
基金
芬兰科学院;
关键词
Sound source localization; sound distance estimation; binaural audio; DEEP NEURAL-NETWORKS; POSITION ESTIMATION; HEAD MOVEMENTS; ROBUST; MODEL;
D O I
10.1109/TASLP.2023.3346297
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we investigate the tasks of binaural source distance estimation (SDE) and direction-of-arrival estimation (DOAE) using motion-based cues in a scenario with a walking listener. On top of performing both tasks as separate problems, we study two methods of solving the joint task of simultaneous source distance estimation and localization (SDEL), with a single model. Experiments are conducted for three different scenarios: a static receiver; a static receiver with a rotating head; and a freely moving listener inside a room. The study proposes rotation and translation features to include information about the receiver's motion during model training and studies the effects of these on the final performance. The work includes extended simulation of three datasets containing numerous testing scenarios for sound sources, covering a wide range of DOAs and a source-to-receiver distance up to 15 m. Results are further analyzed with respect to room reverberation, walking speed, as well as source-to-receiver distance. The presented outcomes show large improvements in both DOA and distance estimation for a model that uses motion-based cues as compared with a static scenario. These include a decrease of 9.50(degrees) in DOA and 1.56 m in distance errors for a joint model, followed by 16.17(degrees) and 0.17 m for separate models.
引用
收藏
页码:996 / 1011
页数:16
相关论文
共 50 条
  • [1] A study on distance estimation in binaural sound localization
    Rodemann, Tobias
    IEEE/RSJ 2010 INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2010), 2010, : 425 - 430
  • [2] Binaural Estimation of Sound Source Distance via the Direct-to-Reverberant Energy Ratio for Static and Moving Sources
    Lu, Yan-Chen
    Cooke, Martin
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (07): : 1793 - 1805
  • [3] Binaural localization for a mobile sound source
    Kumon M.
    Uozumi S.
    Journal of Biomechanical Science and Engineering, 2011, 6 (01): : 26 - 39
  • [4] Sound Source Distance Estimation in Rooms based on Statistical Properties of Binaural Signals
    Georganti, Eleftheria
    May, Tobias
    van de Par, Steven
    Mourjopoulos, John
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (08): : 1727 - 1741
  • [5] Binaural Sound Source Distance Learning in Rooms
    Vesa, Sampo
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (08): : 1498 - 1507
  • [6] Full Sound Source Localization of Binaural Signals
    Venkatesan, R.
    Ganesh, A. Balaji
    2017 2ND IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2017, : 213 - 217
  • [7] The simulation of binaural hearing caused by a moving sound source
    Lee, P. L.
    Wang, J. H.
    COMPUTERS & STRUCTURES, 2009, 87 (17-18) : 1102 - 1110
  • [8] Sound source distance learning based on binaural signals
    Vesa, Sampo
    2007 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, 2007, : 9 - 12
  • [9] Localization of sound source direction using the binaural model
    Department of Production Systems Engineering, Toyohashi University of Technology, 1-1 Hibarigaoka, Tempaku-cho, Toyohashi-shi, Aichi, 441-8580, Japan
    Nihon Kikai Gakkai Ronbunshu C, 2008, 3 (642-649):
  • [10] Binaural Sound Source Localization in Real and Virtual Rooms
    Rychtarikova, Monika
    Van den Bogaert, Tim
    Vermeir, Gerrit
    Wouters, Jan
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2009, 57 (04): : 205 - 220