3D Localization of Multiple Simultaneous Speakers with Discrete Wavelet Transform and Proposed 3D Nested Microphone Array

被引:0
|
作者
Firoozabadi, Ali Dehghan [1 ]
Durney, Hugo [1 ]
Soto, Ismael [2 ]
Olave, Miguel Sanhueza [1 ]
机构
[1] Univ Tecnol Metropolitana, Dept Elect, Av Jose Pedro Alessandri 1242, Santiago 7800002, Chile
[2] Univ Santiago Chile, Elect Engn Dept, Santiago, Chile
关键词
Simultaneous sound source localization; Wavelet Transform; Generalized Cross-Correlation; Nested microphone array; Subband processing;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Multiple sound source localization is one of the important topic in speech processing. GCC function is used as a traditional algorithm for sound source localization. This function estimates DOA for multiple speakers by calculation the cross-correlation between microphone signals but its accuracy decreases in adverse conditions. The aim of proposed method in this paper is localization of multiple simultaneous speakers in undesirable condition. The proposed method is based on novel 3D nested microphone array in combination with obtained information of Discrete Wavelet Transform (DWT) and subband processing. The proposed 3D nested microphone array prepares the condition for 3D localization and eliminates the spatial aliasing between microphone signals. Also, we propose the DWT for extraction the information of speech signal. Since, the spectral information of speech signal concentrates on low frequencies, we propose a structure of filter bank based on DWT to increase the frequency resolution on low frequencies. The performed evaluation on real and simulated data shows the superiority of our proposed method in comparison with Fullband and subband processing with uniform filters and uniform microphone array.
引用
收藏
页码:356 / 360
页数:5
相关论文
共 50 条
  • [1] 3D Multiple Sound Source Localization by Proposed Cuboids Nested Microphone Array in Combination with Adaptive Wavelet-Based Subband GEVD
    Dehghan Firoozabadi, Ali
    Irarrazaval, Pablo
    Adasme, Pablo
    Zabala-Blanco, David
    Palacios-Jativa, Pablo
    Azurdia-Meza, Cesar
    ELECTRONICS, 2020, 9 (05)
  • [2] Future activity prediction of multiple sclerosis with 3D MRI using 3D discrete wavelet transform
    Acar, Zueleyha Yilmaz
    Basciftci, Fatih
    Ekmekci, Ahmet Hakan
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 78
  • [3] Coronary Arteries Segmentation Based on the 3D Discrete Wavelet Transform and 3D Neutrosophic Transform
    Chen, Shuo-Tsung
    Wang, Tzung-Dau
    Lee, Wen-Jeng
    Huang, Tsai-Wei
    Hung, Pei-Kai
    Wei, Cheng-Yu
    Chen, Chung-Ming
    Kung, Woon-Man
    BIOMED RESEARCH INTERNATIONAL, 2015, 2015
  • [4] Diagnose Alzheimer's disease by combining 3D discrete wavelet transform and 3D moment invariants
    Lao, Huan
    Zhang, Xuejun
    IET IMAGE PROCESSING, 2022, 16 (14) : 3948 - 3964
  • [5] Compression of 3D Integral Images Using 3D Wavelet Transform
    Aggoun, Amar
    JOURNAL OF DISPLAY TECHNOLOGY, 2011, 7 (11): : 586 - 592
  • [6] 3D Discrete Wavelet Transform VLSI Architecture for Image Processing
    Tripathy, Malay Ranjan
    Sachdeva, Kapil
    Talhi, Rachid
    PIERS 2009 MOSCOW VOLS I AND II, PROCEEDINGS, 2009, : 1569 - +
  • [7] Hyperspectral Face Recognition using 3D Discrete Wavelet Transform
    Ghasemzadeh, Aman
    Demirel, Hasan
    2016 SIXTH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING THEORY, TOOLS AND APPLICATIONS (IPTA), 2016,
  • [8] 3D Model Retrieval Based on 3D Discrete Cosine Transform
    Lmaati, Elmustapha Ait
    El Oirrak, Ahmed
    Kaddioui, Mohamaed Najib
    Ouahman, Abdellah Ait
    Sadgal, Mohammed
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2010, 7 (03) : 264 - 270
  • [9] Speech source 3D localization focusing algorithms based on microphone array
    Tai-liang, Ju
    Qi-cong, Peng
    Huai-zong, Shao
    2006 7TH INTERNATIONAL SYMPOSIUM ON ANTENNAS, PROPAGATION AND EM THEORY, VOLS 1 AND 2, PROCEEDINGS, 2006, : 108 - 111
  • [10] 3D Microphone Array Comparison: Objective Measurements
    Lee, Hyunkook
    Johnson, Dale
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2021, 69 (11): : 871 - 887