Speakers counting by proposed nested microphone array in combination with limited space SRP

被引:1
|
作者
Dehghan Firoozabadi, Ali [1 ]
Irarrazaval, Pablo [2 ]
Adasme, Pablo [3 ]
Zabala-Blanco, David [4 ]
Palacios-Jativa, Pablo [5 ]
Durney, Hugo [1 ]
Sanhueza, Miguel [1 ]
Azurdia-Meza, Cesar [5 ]
机构
[1] Univ Tecnol Metropolitana, Dept Elect, Av Jose Pedro Alessandri 1242, Santiago 7800002, Chile
[2] Pontificia Univ Catolica Chile, Elect Engn Dept, Santiago, Chile
[3] Univ Santiago Chile, Elect Engn Dept, Av Ecuador 3519, Santiago 9170124, Chile
[4] Univ Catolica Maule, Ctr Invest Estudios Avanzados Maule CIEAM, Vicerrectoria Invest & Postgrad, Talca 3466706, Chile
[5] Univ Chile, Dept Elect Engn, Santiago 8370451, Chile
关键词
Speakers counting; nested microphone array; subband processing; classification; filtering; SIGNALS; NUMBER;
D O I
10.23919/EUSIPCO54536.2021.9616309
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, a novel method is presented for estimating the number of speakers based on the microphone arrays. Firstly, a 3D snowflake nested microphone array (SNMA) is proposed for recording the speech signals. In the following, the steered response power (SRP) algorithm is implemented on subbands in limited spaces conditions for all microphone pairs related to the subarrays. Therefore, a weighted averaging method is implemented on subband limited spaces SRPs (LSRP), and the final energy map is compared with the histogram of the maximums of the SRP function on different subbands for various time frames. The passed candidate points are categorized by unsupervised K-means clustering and the number of speakers is estimated by the silhouette criteria. The accuracy of the proposed method is compared with PENS, i-vector PLDA, and wavelet-GEVD algorithms. The results show the superiority of the proposed method in comparison with other previous research.
引用
收藏
页码:271 / 275
页数:5
相关论文
共 9 条
  • [1] A Novel Quasi-Spherical Nested Microphone Array and Multiresolution Modified SRP by GammaTone Filterbank for Multiple Speakers Localization
    Firoozabadi, Ali Dehghan
    Irarrazaval, Pablo
    Adasme, Pablo
    Durney, Hugo
    Olave, Miguel Sanhueza
    2019 SIGNAL PROCESSING ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS (SPA 2019), 2019, : 208 - 213
  • [2] A Novel Nested Circular Microphone Array and Subband Processing-Based System for Counting and DOA Estimation of Multiple Simultaneous Speakers
    Ali Dehghan Firoozabadi
    Hamid Reza Abutalebi
    Circuits, Systems, and Signal Processing, 2016, 35 : 573 - 601
  • [3] A Novel Nested Circular Microphone Array and Subband Processing-Based System for Counting and DOA Estimation of Multiple Simultaneous Speakers
    Firoozabadi, Ali Dehghan
    Abutalebi, Hamid Reza
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2016, 35 (02) : 573 - 601
  • [4] 3D Localization of Multiple Simultaneous Speakers with Discrete Wavelet Transform and Proposed 3D Nested Microphone Array
    Firoozabadi, Ali Dehghan
    Durney, Hugo
    Soto, Ismael
    Olave, Miguel Sanhueza
    2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 356 - 360
  • [5] Multiresolution Speech Enhancement Based on Proposed Circular Nested Microphone Array in Combination with Sub-Band Affine Projection Algorithm
    Firoozabadi, Ali Dehghan
    Irarrazaval, Pablo
    Adasme, Pablo
    Zabala-Blanco, David
    Durney, Hugo
    Sanhueza, Miguel
    Palacios-Jativa, Pablo
    Azurdia-Meza, Cesar
    APPLIED SCIENCES-BASEL, 2020, 10 (11):
  • [6] Evaluation of localization precision by proposed quasi-spherical nested microphone array in combination with multiresolution adaptive steered response power
    Firoozabadi, Ali Dehghan
    Irarrazaval, Pablo
    Adasme, Pablo
    Zabala-Blanco, David
    Azurdia-Meza, Cesar
    JOURNAL OF ELECTRICAL ENGINEERING-ELEKTROTECHNICKY CASOPIS, 2020, 71 (03): : 150 - 164
  • [7] Speaker Counting Based on a Novel Hive Shaped Nested Microphone Array by WPT and 2D Adaptive SRP Algorithms in Near-Field Scenarios
    Firoozabadi, Ali Dehghan
    Adasme, Pablo
    Zabala-Blanco, David
    Palacios Jativa, Pablo
    Azurdia-Meza, Cesar
    SENSORS, 2023, 23 (09)
  • [8] Combination of Nested Microphone Array and Subband Processing for Multiple Simultaneous Speaker Localization
    Firoozabadi, Ali Dehghan
    Abutalebi, Hamid Reza
    2012 SIXTH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (IST), 2012, : 907 - 912
  • [9] 3D Multiple Sound Source Localization by Proposed Cuboids Nested Microphone Array in Combination with Adaptive Wavelet-Based Subband GEVD
    Dehghan Firoozabadi, Ali
    Irarrazaval, Pablo
    Adasme, Pablo
    Zabala-Blanco, David
    Palacios-Jativa, Pablo
    Azurdia-Meza, Cesar
    ELECTRONICS, 2020, 9 (05)