A Framework for Speech Enhancement With Ad Hoc Microphone Arrays

被引:31
|
作者
Tavakoli, Vincent Mohammad [1 ]
Jensen, Jesper Rindom [1 ]
Christensen, Mads Graesboll [1 ]
Benesty, Jacob [2 ,3 ]
机构
[1] Aalborg Univ, Dept Architecture Design & Media Technol, Audio Anal Lab, DK-9000 Aalborg, Denmark
[2] Univ Quebec, INRS EMT, Montreal, PQ H5A 1K6, Canada
[3] Aalborg Univ, Dept Architecture Design & Media Technol, DK-9000 Aalborg, Denmark
关键词
Speech enhancement; microphone array; noise reduction; multichannel; pseudo-coherence vector; ad hoc array; NOISE-REDUCTION; SIGNAL ESTIMATION; NETWORKS; ALGORITHM;
D O I
10.1109/TASLP.2016.2537202
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speech enhancement is vital for improved listening practices. Ad hoc microphone arrays are promising assets for this purpose. Most well-established enhancement techniques with conventional arrays can be adapted into ad hoc scenarios. Despite recent efforts to introduce various ad hoc speech enhancement apparatus, a common framework for integration of conventional methods into this new scheme is still missing. This paper establishes such an abstraction based on inter and intra subarray speech coherencies. Along with measures for signal quality at the input of subarrays, a measure of coherency is proposed both for subarray selection in local enhancement approaches, and also for selecting a proper global reference when more than one subarray are used. Proposed methods within this framework are evaluated with regard to quantitative and qualitative measures, including array gains, the speech distortion ratio, the PESQ measure, and the STOI intelligibility measure. Major findings in this work are the observed changes in the superiority of different methods for certain conditions. When perceptual quality or intelligibility of the speech are the ultimate goals, there are turning points where the MVDR and the LCMV are superior to Wiener-based methods. Also, for certain scenarios, local approaches may be preferred to global ones.
引用
收藏
页码:1038 / 1051
页数:14
相关论文
共 50 条
  • [1] DISTRIBUTED MAX-SINR SPEECH ENHANCEMENT WITH AD HOC MICROPHONE ARRAYS
    Tavakoli, Vincent M.
    Jensen, Jesper R.
    Heusdens, Richard
    Benesty, Jacob
    Christensen, Mads G.
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 151 - 155
  • [2] Continuous Speech Separation with Ad Hoc Microphone Arrays
    Wang, Dongmei
    Yoshioka, Takuya
    Chen, Zhuo
    Wang, Xiaofei
    Zhou, Tianyan
    Meng, Zhong
    [J]. 29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 1100 - 1104
  • [3] COMMUNICATION-COST AWARE MICROPHONE SELECTION FOR NEURAL SPEECH ENHANCEMENT WITH AD-HOC MICROPHONE ARRAYS
    Casebeer, Jonah
    Kaikaus, Jamshed
    Smaragdis, Paris
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 8438 - 8442
  • [4] PSEUDO-COHERENCE-BASED MVDR BEAMFORMER FOR SPEECH ENHANCEMENT WITH AD HOC MICROPHONE ARRAYS
    Tavakoli, Vincent Mohammad
    Jensen, Jesper Rindom
    Christensen, Mads Graesboll
    Benesty, Jacob
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 2659 - 2663
  • [5] Glottal Model Based Speech Beamforming for Ad-Hoc Microphone Arrays
    Zhang, Yang
    Florencio, Dinei
    Hasegawa-Johnson, Mark
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2675 - 2679
  • [6] Speech enhancement with ad-hoc microphone array using single source activity
    Sakanashi, Ryutaro
    Ono, Nobutaka
    Miyabe, Shigeki
    Yamada, Takeshi
    Makino, Shoji
    [J]. 2013 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2013,
  • [7] Sound Localization for Ad-Hoc Microphone Arrays
    Liaquat, Muhammad Usman
    Munawar, Hafiz Suliman
    Rahman, Amna
    Qadir, Zakria
    Kouzani, Abbas Z.
    Mahmud, M. A. Parvez
    [J]. ENERGIES, 2021, 14 (12)
  • [8] Scaling sparsemax based channel selection for speech recognition with ad-hoc microphone arrays
    Chen, Junqi
    Zhang, Xiao-Lei
    [J]. INTERSPEECH 2021, 2021, : 291 - 295
  • [9] NEAR-FIELD SOURCE EXTRACTION USING SPEECH PRESENCE PROBABILITIES FOR AD HOC MICROPHONE ARRAYS
    Taseska, Maja
    Markovich-Golan, Shmulik
    Habets, Emanuel A. P.
    Gannot, Sharon
    [J]. 2014 14TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2014, : 169 - 173
  • [10] Detecting multiple, simultaneous talkers through localising speech recorded by ad-hoc microphone arrays
    Pasha, Shahab
    Ritz, Christian
    Zou, Y. X.
    [J]. 2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,