A low-complexity permutation alignment method for frequency-domain blind source separation

被引:9
|
作者
Kang, Fang [1 ,3 ]
Yang, Feiran [1 ,2 ,3 ]
Yang, Jun [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Inst Acoust, Key Lab Noise & Vibrat Res, Beijing 100190, Peoples R China
[2] Chinese Acad Sci, Inst Acoust, State Key Lab Acoust, Beijing 100190, Peoples R China
[3] Univ Chinese Acad Sci, Sch Elect Elect & Commun Engn, Beijing 100049, Peoples R China
基金
国家重点研发计划;
关键词
Blind source separation (BSS); Permutation problem; Local permutation alignment; Global correction; Computational complexity; INDEPENDENT VECTOR ANALYSIS; SPEECH SEPARATION; ROBUST; NETWORKS; ICA;
D O I
10.1016/j.specom.2019.11.002
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Frequency-domain blind source separation is an effective way to separate the signals from convolutive mixtures. The independence component analysis (ICA) is commonly employed to separate signals in each frequency bin, resulting in the well-known permutation problem. To resolve this problem, we present a low-complexity permutation alignment method based on the inter-frequency dependence of signal power ratio. A bin-wise permutation alignment is first carried out across all the frequency bins by measuring the correlation between the current frequency bin and the previous one, but only the permutation with a high confidence is fixed. The permutation with low confidence is then determined by maximizing the correlation between the current frequency bin and a local centroid, which is calculated from a set of determined frequency bins with high confidence. By so doing, the permutation for most frequency bins is aligned without iterations. Finally, a clustering algorithm with centroids is adopted to achieve the fine global optimization in the fullband with only a few iterations. Experiment results show that the proposed method achieves a comparable performance with the state-of-the-art permutation alignment schemes, but the new method achieves a significant computational saving.
引用
收藏
页码:88 / 94
页数:7
相关论文
共 50 条
  • [1] INCREMENTAL METHOD OF PERMUTATION ALIGNMENT FOR FREQUENCY-DOMAIN BLIND SOURCE SEPARATION
    Emura, Satoru
    [J]. 2022 INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC 2022), 2022,
  • [2] A new method for solving the permutation problem of frequency-domain blind source separation
    Hu, XB
    Kobatake, H
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2005, E88A (06) : 1543 - 1548
  • [3] A Novel Permutation Algorithm in Frequency-Domain Blind Source Separation
    Lv, Zhao
    Wu, Xiaopei
    Zhou, Bangyan
    Zhang, Chao
    [J]. 2015 10TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS AND SIGNAL PROCESSING (ICICS), 2015,
  • [4] A robust and precise method for solving the permutation problem of frequency-domain blind source separation
    Sawada, H
    Mukai, R
    Araki, S
    Makino, S
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2004, 12 (05): : 530 - 538
  • [5] AN EXPECTATION-MAXIMIZATION METHOD FOR THE PERMUTATION PROBLEM IN FREQUENCY-DOMAIN BLIND SOURCE SEPARATION
    Ngo, Thom-Thi
    Nam, Seung-Hyon
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 17 - 20
  • [6] A robust approach to the permutation problem of frequency-domain blind source separation
    Sawada, H
    Mukai, R
    Araki, S
    Makino, S
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS: SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO AND ELECTROACOUSTICS MULTIMEDIA SIGNAL PROCESSING, 2003, : 381 - 384
  • [7] A Region-Growing Permutation Alignment Approach in Frequency-Domain Blind Source Separation of Speech Mixtures
    Wang, Lin
    Ding, Heping
    Yin, Fuliang
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (03): : 549 - 557
  • [8] A beamforming approach to permutation alignment for multichannel frequency-domain blind speech separation
    Ikram, MZ
    Morgan, DR
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 881 - 884
  • [9] A new method of solving permutation problem in blind source separation for convolutive acoustic signals in frequency-domain
    Wu, Wenyan
    Zhang, Liming
    [J]. 2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 1237 - 1242
  • [10] Generalized method for solving the permutation problem in frequency-domain blind source separation of convolved speech signals
    Sarmiento, Auxiliadora
    Duran, Ivan
    Cruces, Sergio
    Aguilera, Pablo
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 572 - 575