A beamforming approach to permutation alignment for multichannel frequency-domain blind speech separation

被引:0
|
作者
Ikram, MZ [1 ]
Morgan, DR [1 ]
机构
[1] Georgia Inst Technol, Ctr Signal & Image Proc, Atlanta, GA 30332 USA
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this work, we explore important connections between blind source separation (BSS) and ideal beamforming. We first compare the performance of a null-steering beamformer against that of a frequency-domain BSS method in a reverberant environment, drawing some interesting conclusions. We then examine the feasibility of using beamformer concepts to resolve permutation inconsistency across frequency, which degrades the performance of BSS methods in a reverberant environment. We also propose a permutation alignment scheme based on information gathered from the microphone array directivity patterns. This technique is novel in the sense that it works satisfactorily even when the directivity patterns exhibit grating lobes, where, in fact, better separation can be achieved in principle. We perform experiments that support the viability of the proposed method under different operating conditions and microphone spacings.
引用
收藏
页码:881 / 884
页数:4
相关论文
共 50 条
  • [31] FREQUENCY-DOMAIN BLIND SPEECH SEPARATION USING INCOMPLETE DE-MIXING TRANSFORM
    Koldovsky, Zbynek
    Nesta, Francesco
    Tichavsky, Pert
    Ono, Nobutaka
    [J]. 2016 24TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2016, : 1663 - 1667
  • [32] A class of frequency-domain adaptive approaches to blind multichannel identification
    Huang, YT
    Benesty, J
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2003, 51 (01) : 11 - 24
  • [33] The Improved Method for Solving Permutation Problem in Frequency Domain Blind Source Separation of Speech Signals
    Zhang Dexiang
    Wu Xiaopei
    Lv Zhao
    Guo Xiaojing
    [J]. MATERIALS SCIENCE AND INFORMATION TECHNOLOGY, PTS 1-8, 2012, 433-440 : 7029 - 7034
  • [34] Microphone array beamforming approach to blind speech separation
    Himawan, Ivan
    McCowan, Iain
    Lincoln, Mike
    [J]. MACHINE LEARNING FOR MULTIMODAL INTERACTION, 2008, 4892 : 295 - +
  • [35] Solving Permutation Problem in Frequency-Domain Blind Source Separation Using Microphone Sub-arrays
    Li, Wanlong
    Liu, Ju
    Du, Jun
    Bai, Shuzhong
    [J]. 2008 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS AND SIGNAL PROCESSING, VOLS 1 AND 2, 2007, : 67 - 72
  • [36] A Sparsity-Based Method to Solve Permutation Indeterminacy in Frequency-Domain Convolutive Blind Source Separation
    Sudhakar, Prasad
    Gribonval, Remi
    [J]. INDEPENDENT COMPONENT ANALYSIS AND SIGNAL SEPARATION, PROCEEDINGS, 2009, 5441 : 338 - 345
  • [37] An MSE-Based Method to Avoid Permutation/Gain Indeterminacy in Frequency-Domain Blind Source Separation
    Adriana Dapena
    Daniel Iglesia
    Carlos J. Escudero
    [J]. Circuits, Systems and Signal Processing, 2010, 29 : 403 - 417
  • [38] An MSE-Based Method to Avoid Permutation/Gain Indeterminacy in Frequency-Domain Blind Source Separation
    Dapena, Adriana
    Iglesia, Daniel
    Escudero, Carlos J.
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2010, 29 (03) : 403 - 417
  • [39] A new method of solving permutation problem in blind source separation for convolutive acoustic signals in frequency-domain
    Wu, Wenyan
    Zhang, Liming
    [J]. 2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 1237 - 1242
  • [40] TOWARD OVERCOMING FUNDAMENTAL LIMITATION IN FREQUENCY-DOMAIN BLIND SOURCE SEPARATION FOR REVERBERANT SPEECH MIXTURES
    Kim, Lae-Hoon
    Hasegawa-Johnson, Mark
    [J]. 2010 CONFERENCE RECORD OF THE FORTY FOURTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS (ASILOMAR), 2010, : 542 - 545