Performance of phase transform for detecting sound sources with microphone arrays in reverberant and noisy environments

被引:29
|
作者
Donohue, Kevin D. [1 ]
Hannemann, Jens [1 ]
Dietz, Henry G. [1 ]
机构
[1] Univ Kentucky, Ctr Visualizat & Virtual Environm, Lexington, KY 40507 USA
基金
美国国家科学基金会;
关键词
phase transform; microphone array;
D O I
10.1016/j.sigpro.2007.01.013
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The performance of sound source location (SSL) algorithms with microphone arrays can be enhanced by processing signals prior to the delay and sum operation. The phase transform (PHAT) has been shown to improve SSL images, especially in reverberant environments. This paper introduces a modification, referred to as the PHAT-beta transform, that varies the degree of spectral magnitude information used by the transform through a single parameter. Performance results are computed using a Monte Carlo simulation of an eight element perimeter array with a receiver operating characteristic (ROC) analysis for detecting single and multiple sound sources. In addition, a Fisher's criterion performance measure is also computed for target and noise peak separability and compared to the ROC results. Results show that the standard PHAT significantly improves detection performance for broadband signals especially in high levels of reverberation noise, and to a lesser degree for noise from other coherent sources. For narrowband targets the PHAT typically results in significant performance degradation; however, the PHAT-beta can achieve performance improvements for both narrowband and broadband signals. Finally, the performance for real speech signal samples is examined and shown to exhibit properties similar to both the simulated broad and narrowband cases, suggesting the use of beta values between 0.5 and 0.7 for array applications with general signals. (c) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:1677 / 1691
页数:15
相关论文
共 50 条
  • [41] Improving the Localization Accuracy of Dipole Sound Sources Using Planar Microphone Arrays
    V. F. Kopiev
    V. V. Ershov
    I. V. Khramtsov
    O. Yu. Kustov
    Acoustical Physics, 2023, 69 : 206 - 219
  • [42] Combined LCMV-TRINICON Beamforming for Separating Multiple Speech Sources in Noisy and Reverberant Environments
    Markovich-Golan, Shmulik
    Gannot, Sharon
    Kellermann, Walter
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (02) : 320 - 332
  • [43] Improving the Localization Accuracy of Dipole Sound Sources Using Planar Microphone Arrays
    Kopiev, V. F.
    Ershov, V. V.
    Khramtsov, I. V.
    Kustov, O. Yu.
    ACOUSTICAL PHYSICS, 2023, 69 (02) : 206 - 219
  • [44] Reconstructing the normal velocities of acoustic sources in noisy environments using a rigid microphone array
    Xiang, Shang
    Jiang, Weikang
    Jiang, Hao
    Gao, Jianzheng
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2016, 140 (03): : 2082 - 2090
  • [45] AUDIO SIGNAL CLASSIFICATION IN REVERBERANT ENVIRONMENTS BASED ON FUZZY-CLUSTERED AD-HOC MICROPHONE ARRAYS
    Gergen, Sebastian
    Nagathil, Anil
    Martin, Rainer
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 3692 - 3696
  • [46] Integration of Multiple Microphone Arrays and Use of Sound Reflections for 3D Localization of Sound Sources
    Ishi, Carlos T.
    Even, Jani
    Hagita, Norihiro
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2014, E97A (09) : 1867 - 1874
  • [47] Robust tracking of multiple sound sources by spatial integration of room and robot microphone arrays
    Nakadai, Kazuhiro
    Nakajima, Hirofumi
    Murase, Masamitsu
    Kaiiiri, Satoshi
    Yamada, Kentaro
    Nakamura, Takahiro
    Hasegawa, Yuji
    Okuno, Hiroshi G.
    Tsujino, Hiroshi
    2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 4599 - 4602
  • [48] Using multiple microphone arrays and reflections for 3D localization of sound sources
    Ishi, Carlos T.
    Even, Jani
    Hagita, Norihiro
    2013 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2013, : 3937 - 3942
  • [49] Sparse Iterative Beamforming Using Spherical Microphone Arrays for Low-Latency Direction of Arrival Estimation in Reverberant Environments
    Mathews, Jonathan
    Braasch, Jonas
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2021, 69 (12): : 967 - 977
  • [50] Estimation of speech recognition performance in noisy and reverberant environments using PESQ score and acoustic parameters
    Fukumori, Takahiro
    Nakayama, Masato
    Nishiura, Takanobu
    Yamashita, Yoichi
    2013 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2013,