STEREO SOURCE SEPARATION IN THE FREQUENCY DOMAIN: SOLVING THE PERMUTATION PROBLEM BY A SLIDING K-MEANS METHOD

被引:0
|
作者
Chen, Bang-Yin [1 ]
Liu, Tzu-Chi [1 ]
Liu, Yi-Wen [1 ]
机构
[1] Natl Tsing Hua Univ, Dept Elect Engn, Hsinchu, Taiwan
关键词
blind source separation (BSS); permutation problem; independent component analysis (ICA); BLIND SOURCE SEPARATION; CONVOLUTIVE MIXTURES;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Blind source separation (BSS) has been widely utilized for recovering a set of source signals from their mixtures. When the mixture is convolutive, source separation can be solved in the frequency domain but involves several challenges including the scaling uncertainty and the permutation indeterminacy. This paper presents a sliding k-means algorithm to handle the permutation problem. Experiments were conducted by playing the source files to a pair of loudspeakers and obtaining the mixture by microphones. Objective indices are then defined to evaluate the separation performance based on the actual frequency responses. Results have shown that the standard k-means method alone can consistently achieve > 90.5% permutation accuracy in different parameter settings. After introducing the proposed sliding process, the permutation accuracy further rises. Compared to a previous de-permutation method [1], the present method has a more stable performance against parameter variations in terms of its permutation accuracy and signal-to-interference ratio (SIR).
引用
收藏
页码:4250 / 4254
页数:5
相关论文
共 50 条
  • [1] Improved Method for Solving Permutation Problem of Frequency Domain Blind Source Separation
    Wang Weihua
    Huang Fenggang
    [J]. 2008 6TH IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS, VOLS 1-3, 2008, : 670 - 673
  • [2] A new method for solving the permutation problem of frequency-domain blind source separation
    Hu, XB
    Kobatake, H
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2005, E88A (06) : 1543 - 1548
  • [3] Partial separation method for solving permutation problem in frequency domain blind source separation of speech signals
    Reju, V. G.
    Koh, Soo Ngee
    Soon, Ing Yann
    [J]. NEUROCOMPUTING, 2008, 71 (10-12) : 2098 - 2112
  • [4] The Improved Method for Solving Permutation Problem in Frequency Domain Blind Source Separation of Speech Signals
    Zhang Dexiang
    Wu Xiaopei
    Lv Zhao
    Guo Xiaojing
    [J]. MATERIALS SCIENCE AND INFORMATION TECHNOLOGY, PTS 1-8, 2012, 433-440 : 7029 - 7034
  • [5] A robust and precise method for solving the permutation problem of frequency-domain blind source separation
    Sawada, H
    Mukai, R
    Araki, S
    Makino, S
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2004, 12 (05): : 530 - 538
  • [6] A robust correlation method for solving permutation problem in frequency domain blind source separation of speech signals
    Reju, V. G.
    Koh, Soo Ngee
    Soon, Ing Yann
    [J]. 2006 IEEE ASIA PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS, 2006, : 1891 - +
  • [7] A new method of solving permutation problem in blind source separation for convolutive acoustic signals in frequency-domain
    Wu, Wenyan
    Zhang, Liming
    [J]. 2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 1237 - 1242
  • [8] Generalized method for solving the permutation problem in frequency-domain blind source separation of convolved speech signals
    Sarmiento, Auxiliadora
    Duran, Ivan
    Cruces, Sergio
    Aguilera, Pablo
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 572 - 575
  • [9] A new approach to the permutation problem in frequency domain blind source separation
    Kamata, K
    Hu, XB
    Kobatake, H
    [J]. INDEPENDENT COMPONENT ANALYSIS AND BLIND SIGNAL SEPARATION, 2004, 3195 : 849 - 856
  • [10] AN EXPECTATION-MAXIMIZATION METHOD FOR THE PERMUTATION PROBLEM IN FREQUENCY-DOMAIN BLIND SOURCE SEPARATION
    Ngo, Thom-Thi
    Nam, Seung-Hyon
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 17 - 20