Evaluation of localization precision by proposed quasi-spherical nested microphone array in combination with multiresolution adaptive steered response power

被引:0
|
作者
Firoozabadi, Ali Dehghan [1 ]
Irarrazaval, Pablo [2 ,3 ,4 ]
Adasme, Pablo [5 ]
Zabala-Blanco, David [6 ]
Azurdia-Meza, Cesar [7 ]
机构
[1] Univ Tecnol Metropolitana, Dept Elect, Av Jose Pedro Alessandri 1242, Santiago 7800002, Chile
[2] Pontificia Univ Catolica Chile, Elect Engn Dept, Santiago 7820436, Chile
[3] Pontificia Univ Catolica Chile, Biomed Imaging Ctr, Santiago 7820436, Chile
[4] Pontificia Univ Catolica Chile, Inst Biol & Med Engn, Santiago 7820436, Chile
[5] Univ Santiago Chile, Elect Engn Dept, Av Ecuador 3519, Santiago 9170124, Chile
[6] Univ Catolica Maule, Dept Comp & Ind, Talca 3466706, Chile
[7] Univ Chile, Dept Elect Engn, Santiago 8370451, Chile
关键词
sound source localization; nested microphone array; subband processing; time delay estimation; filter bank; SPEECH-SOURCE; SOUND SOURCE; ROBUST;
D O I
10.2478/jee-2020-0022
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Multiple sound source localization in noisy and reverberant conditions is one of the important challenges in the speech signal processing. The aim of this article is three-dimensional sound source localization in undesirable scenarios. For the localization algorithms, the spatial aliasing is one of the destructive factors in reducing the accuracy. Firstly, a 3D quasi-spherical nested microphone array (QSNMA) is proposed for eliminating the spatial aliasing. Since the speech signal has the windowed-disjoint orthogonality property, the speech information differs in terms of the frequency bands. Then, the Gammatone filter bank is introduced for the speech subband processing. In the following, the multiresolution steered response power (SRP) algorithm is adaptively implemented on subbands with the phase transform (PHAT)/maximum likelihood (ML) weighted functions based on the levels of the noise and reverberation. The peaks of the multiresolution adaptive SRP (MASRP) algorithm are extracted in each subband based on the number of speakers for continuous time frames. Finally, the distribution of these peaks are calculated in each subband and they are merged by the use of weighted averaging method. The final 3D speakers locations are estimated by extracting the peaks in the final distribution. The proposed QSNMA-MASRP(PHAT/ML) algorithm is evaluated on real and simulated data for 2 and 3 simultaneous speakers in noisy and reverberant conditions. The proposed method is compared with SRP-PHAT, spectral source model-deep neural network, and spherical harmonic temporal extension of multiple response model sparse Bayesian learning algorithms on different range of signal-to-noise ratio and reverberation time. The mean absolute estimation error, averaged standard deviation for absolute estimation error, and computational complexity results show the superiority of the proposed method.
引用
收藏
页码:150 / 164
页数:15
相关论文
共 4 条
  • [1] A Novel Quasi-Spherical Nested Microphone Array and Multiresolution Modified SRP by GammaTone Filterbank for Multiple Speakers Localization
    Firoozabadi, Ali Dehghan
    Irarrazaval, Pablo
    Adasme, Pablo
    Durney, Hugo
    Olave, Miguel Sanhueza
    2019 SIGNAL PROCESSING ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS (SPA 2019), 2019, : 208 - 213
  • [2] Multiresolution Speech Enhancement Based on Proposed Circular Nested Microphone Array in Combination with Sub-Band Affine Projection Algorithm
    Firoozabadi, Ali Dehghan
    Irarrazaval, Pablo
    Adasme, Pablo
    Zabala-Blanco, David
    Durney, Hugo
    Sanhueza, Miguel
    Palacios-Jativa, Pablo
    Azurdia-Meza, Cesar
    APPLIED SCIENCES-BASEL, 2020, 10 (11):
  • [3] 3D Multiple Sound Source Localization by Proposed Cuboids Nested Microphone Array in Combination with Adaptive Wavelet-Based Subband GEVD
    Dehghan Firoozabadi, Ali
    Irarrazaval, Pablo
    Adasme, Pablo
    Zabala-Blanco, David
    Palacios-Jativa, Pablo
    Azurdia-Meza, Cesar
    ELECTRONICS, 2020, 9 (05)
  • [4] A FAST SEARCH METHOD OF STEERED RESPONSE POWER WITH SMALL-APERTURE MICROPHONE ARRAY FOR SOUND SOURCE LOCALIZATION
    Zhao Xiaoyan
    Tang Jie
    Zhou Lin
    Wu Zhenyang
    JournalofElectronics(China), 2013, 30 (05) : 483 - 490