Cepstrum Prefiltering for Binaural Source Localization in Reverberant Environments

被引:18
|
作者
Parisi, Raffaele [1 ]
Camoes, Flavia [1 ]
Scarpiniti, Michele [1 ]
Uncini, Aurelio [1 ]
机构
[1] Univ Roma La Sapienza, Dept Informat Elect & Telecommun DIET, I-00184 Rome, Italy
关键词
Binaural sound localization; cepstral filtering; reverberation;
D O I
10.1109/LSP.2011.2180376
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Binaural sound source localization can be performed by imitation of the fundamental mechanisms of the human auditory system, which is based on the integrated effects of ear, pinnae, head and torso. In particular, two physical cues can be exploited, i.e. the Interaural Time Difference (ITD) and the Interaural Level Difference (ILD). It is known that joint use of ITD and ILD provides good source azimuth estimations [1]. In many practical situations binaural localization has to be performed in closed environments, where the presence of reverberation degrades the performance of available position estimators. In this paper a possible solution to this difficult problem is introduced. The proposed solution is based on proper use of cepstral prefiltering prior to source localization by ITD and ILD. It is shown that cepstrum can help in reducing the effects of reverberation, thus yielding better location estimates.
引用
收藏
页码:99 / 102
页数:4
相关论文
共 50 条
  • [21] Efficient source localization and tracking in reverberant environments using microphone arrays
    Antonacci, F
    Lonoce, D
    Motta, M
    Sarti, A
    Tubaro, S
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 1061 - 1064
  • [22] A hybrid algorithm for robust acoustic source localization in noisy and reverberant environments
    Rajagopalan, Ramesh
    Dessonville, Timothy
    [J]. REMOTE SENSING SYSTEM ENGINEERING V, 2014, 9223
  • [23] Robust Source Localization in Reverberant Environments Based on Weighted Fuzzy Clustering
    Kuehne, Marco
    Togneri, Roberto
    Nordholm, Sven
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2009, 16 (1-3) : 85 - 88
  • [24] Sound source localization in reverberant environments using an outlier elimination algorithm
    Jan, EE
    Flanagan, J
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1321 - 1324
  • [25] Sound Source Localization Based on Robust Least Squares in Reverberant Environments
    Zhu, Hongyan
    Dang, Xudong
    Li, Zelin
    Ge, Quanbo
    [J]. 2018 21ST INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2018, : 2029 - 2035
  • [26] Acoustic Source Localization Based on Geometric Projection in Reverberant and Noisy Environments
    Long, Tao
    Chen, Jingdong
    Huang, Gongping
    Benesty, Jacob
    Cohen, Israel
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2019, 13 (01) : 143 - 155
  • [27] ON THE ROLE OF LOCALIZATION CUES IN BINAURAL SEGREGATION OF REVERBERANT SPEECH
    Woodruff, John
    Wang, DeLiang
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 2205 - +
  • [28] Exploiting Deep Neural Networks and Head Movements for Robust Binaural Localization of Multiple Sources in Reverberant Environments
    Ma, Ning
    May, Tobias
    Brown, Guy J.
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (12) : 2444 - 2453
  • [29] Exploiting top-down source models to improve binaural localisation of multiple sources in reverberant environments
    Ma, Ning
    Brown, Guy J.
    Gonzalez, Jose A.
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 160 - 164
  • [30] A robust dual-microphone speech source localization algorithm for reverberant environments
    Guo, Yanmeng
    Wang, Xiaofei
    Wu, Chao
    Fu, Qiang
    Mai, Ning
    Brown, Guy J.
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3354 - 3358