Binaural Sound Source Distance Estimation and Localization for a Moving Listener

被引:4
|
作者
Krause, Daniel Aleksander [1 ]
Garcia-Barrios, Guillermo [2 ]
Politis, Archontis [1 ]
Mesaros, Annamaria [1 ]
机构
[1] Tampere Univ, Comp Sci, Tampere 33720, Finland
[2] Univ Politecn Madrid, Grp Acoust & MultiMedia Applicat, Madrid 33014, Spain
基金
芬兰科学院;
关键词
Sound source localization; sound distance estimation; binaural audio; DEEP NEURAL-NETWORKS; POSITION ESTIMATION; HEAD MOVEMENTS; ROBUST; MODEL;
D O I
10.1109/TASLP.2023.3346297
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we investigate the tasks of binaural source distance estimation (SDE) and direction-of-arrival estimation (DOAE) using motion-based cues in a scenario with a walking listener. On top of performing both tasks as separate problems, we study two methods of solving the joint task of simultaneous source distance estimation and localization (SDEL), with a single model. Experiments are conducted for three different scenarios: a static receiver; a static receiver with a rotating head; and a freely moving listener inside a room. The study proposes rotation and translation features to include information about the receiver's motion during model training and studies the effects of these on the final performance. The work includes extended simulation of three datasets containing numerous testing scenarios for sound sources, covering a wide range of DOAs and a source-to-receiver distance up to 15 m. Results are further analyzed with respect to room reverberation, walking speed, as well as source-to-receiver distance. The presented outcomes show large improvements in both DOA and distance estimation for a model that uses motion-based cues as compared with a static scenario. These include a decrease of 9.50(degrees) in DOA and 1.56 m in distance errors for a joint model, followed by 16.17(degrees) and 0.17 m for separate models.
引用
收藏
页码:996 / 1011
页数:16
相关论文
共 50 条
  • [41] Sound Source Localization Method Based on Binaural Model Paper Title
    Liu Guanqun
    Zhang Rubo
    Wu Junwei
    2013 CHINESE AUTOMATION CONGRESS (CAC), 2013, : 538 - 541
  • [42] Robust sound image localization for moving listener with curved-type parametric loudspeaker
    Ikefuji, Daisuke
    Nakayama, Masato
    Nishiura, Takanobu
    Yamashita, Yoichi
    2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 1045 - 1049
  • [43] Localization of Steady Sound Source and Direction Detection of Moving Sound Source using CNN
    Mane, Shubham S.
    Mali, Swapnil G.
    Mahajan, S. P.
    2019 10TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2019,
  • [44] Bio-inspired Sound Source Localization Compensated for Sound Diffraction by Binaural Head and Torso
    Shimoyama, Ryuichi
    2012 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND CYBERNETICS (CYBERNETICSCOM), 2012, : 79 - 82
  • [45] Moving sound source localization based on triangulation method
    Miao, Feng
    Yang, Diange
    Wen, Junjie
    Lian, Xiaomin
    JOURNAL OF SOUND AND VIBRATION, 2016, 385 : 93 - 103
  • [46] Euclidean Distance Matrices Application In Sound Source Localization
    Tehrani, Ali Kafaei Zad
    Pourmohammad, Ali
    2017 11TH IEEE INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT 2017), 2017,
  • [47] Localization of underwater moving sound source based on time delay estimation using hydrophone array
    Rahman, S. A.
    Arifianto, D.
    Dhanardono, T.
    Wirawan
    8TH INTERNATIONAL CONFERENCE ON PHYSICS AND ITS APPLICATIONS (ICOPIA), 2016, 776
  • [48] Subjective Evaluation of a Focused Sound Source Reproducing at the positions of a Listener's Moving Hand
    Hirohashi, Miho
    Haneda, Yoichi
    2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 2395 - 2401
  • [49] Localization Estimation of Sound Source by Microphones Array
    Fan, Jing
    Luo, Qian
    Ma, Ding
    2010 SYMPOSIUM ON SECURITY DETECTION AND INFORMATION PROCESSING, 2010, 7 : 312 - 317
  • [50] Auditory inspired binaural robust sound source localization in echoic and noisy environments
    Heckmann, Martin
    Rodemann, Tobias
    Joublin, Frank
    Goerick, Christian
    Schoelling, Bjorn
    2006 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-12, 2006, : 368 - +