Binaural Sound Source Distance Estimation and Localization for a Moving Listener

被引:4
|
作者
Krause, Daniel Aleksander [1 ]
Garcia-Barrios, Guillermo [2 ]
Politis, Archontis [1 ]
Mesaros, Annamaria [1 ]
机构
[1] Tampere Univ, Comp Sci, Tampere 33720, Finland
[2] Univ Politecn Madrid, Grp Acoust & MultiMedia Applicat, Madrid 33014, Spain
基金
芬兰科学院;
关键词
Sound source localization; sound distance estimation; binaural audio; DEEP NEURAL-NETWORKS; POSITION ESTIMATION; HEAD MOVEMENTS; ROBUST; MODEL;
D O I
10.1109/TASLP.2023.3346297
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we investigate the tasks of binaural source distance estimation (SDE) and direction-of-arrival estimation (DOAE) using motion-based cues in a scenario with a walking listener. On top of performing both tasks as separate problems, we study two methods of solving the joint task of simultaneous source distance estimation and localization (SDEL), with a single model. Experiments are conducted for three different scenarios: a static receiver; a static receiver with a rotating head; and a freely moving listener inside a room. The study proposes rotation and translation features to include information about the receiver's motion during model training and studies the effects of these on the final performance. The work includes extended simulation of three datasets containing numerous testing scenarios for sound sources, covering a wide range of DOAs and a source-to-receiver distance up to 15 m. Results are further analyzed with respect to room reverberation, walking speed, as well as source-to-receiver distance. The presented outcomes show large improvements in both DOA and distance estimation for a model that uses motion-based cues as compared with a static scenario. These include a decrease of 9.50(degrees) in DOA and 1.56 m in distance errors for a joint model, followed by 16.17(degrees) and 0.17 m for separate models.
引用
收藏
页码:996 / 1011
页数:16
相关论文
共 50 条
  • [31] Moving Sound Source Localization Based on Sequential Subspace Estimation in Actual Room Environments
    Tsuji, Daisuke
    Suyama, Kenji
    ELECTRONICS AND COMMUNICATIONS IN JAPAN, 2011, 94 (07) : 17 - 26
  • [32] Sound localization and binaural mechanisms
    Blauert, J
    COMPUTATIONAL MODELS OF AUDITORY FUNCTION, 2001, 312 : 79 - 81
  • [33] A linear phase unwrapping method for binaural sound source localization on a robot
    Li, DF
    Levinson, SE
    2002 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS I-IV, PROCEEDINGS, 2002, : 19 - 23
  • [34] Effects of reverberation on sound source localization using binaural spectral cues
    Benton, S
    Spanias, A
    Proceedings of the 23rd IASTED International Conference on Modelling, Identification, and Control, 2004, : 547 - 552
  • [35] Sound Event Detection and Localization with Distance Estimation
    Krause, Daniel Aleksander
    Politis, Archontis
    Mesaros, Annamaria
    32ND EUROPEAN SIGNAL PROCESSING CONFERENCE, EUSIPCO 2024, 2024, : 286 - 290
  • [36] Improved sound source localization in horizontal plane for binaural robot audition
    Kim, Ui-Hyun
    Nakadai, Kazuhiro
    Okuno, Hiroshi G.
    APPLIED INTELLIGENCE, 2015, 42 (01) : 63 - 74
  • [37] Improved sound source localization in horizontal plane for binaural robot audition
    Ui-Hyun Kim
    Kazuhiro Nakadai
    Hiroshi G. Okuno
    Applied Intelligence, 2015, 42 : 63 - 74
  • [38] Toward learning robust contrastive embeddings for binaural sound source localization
    Tang, Duowei
    Taseska, Maja
    van Waterschoot, Toon
    FRONTIERS IN NEUROINFORMATICS, 2022, 16
  • [39] A BINAURAL SOUND SOURCE LOCALIZATION METHOD USING AUDITIVE CUES AND VISION
    Youssef, Karim
    Argentieri, Sylvain
    Zarader, Jean-Luc
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 217 - 220
  • [40] 2D SOUND-SOURCE LOCALIZATION ON THE BINAURAL MANIFOLD
    Deleforge, Antoine
    Horaud, Radu
    2012 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2012,