Generalized method for solving the permutation problem in frequency-domain blind source separation of convolved speech signals

被引:0
|
作者
Sarmiento, Auxiliadora [1 ]
Duran, Ivan [1 ]
Cruces, Sergio [1 ]
Aguilera, Pablo [1 ]
机构
[1] Univ Seville, Dept Signal Theory & Commun, Seville, Spain
关键词
convolutive mixtures; permutation correction; frequency domain separation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The blind speech separation of convolutive mixtures can be performed in the time-frequency domain. The separation problem becomes to a set of instantaneous mixing problems, one for each frequency bin, that can be solved independently by any appropiated instantaneous ICA algorithm. However, the arbitrary order of the estimated sources in each frequency, known as permutation problem, has to be solved to succesfully recover the original sources. This paper deals with the permutation problem in the general case of N sources and N observations. The proposed method combines a correlation approach based on the amplitude correlation property of speech signals, and an optimal pairing scheme to align the permuted solutions. Our method is robust to artificially permuted speech signals. Experimental results on simulated convolutive mixtures show the effectiveness of the proposed method in terms of quality of separated signals by objective and perceptually measures.
引用
收藏
页码:572 / 575
页数:4
相关论文
共 50 条
  • [1] The Improved Method for Solving Permutation Problem in Frequency Domain Blind Source Separation of Speech Signals
    Zhang Dexiang
    Wu Xiaopei
    Lv Zhao
    Guo Xiaojing
    [J]. MATERIALS SCIENCE AND INFORMATION TECHNOLOGY, PTS 1-8, 2012, 433-440 : 7029 - 7034
  • [2] Partial separation method for solving permutation problem in frequency domain blind source separation of speech signals
    Reju, V. G.
    Koh, Soo Ngee
    Soon, Ing Yann
    [J]. NEUROCOMPUTING, 2008, 71 (10-12) : 2098 - 2112
  • [3] A new method for solving the permutation problem of frequency-domain blind source separation
    Hu, XB
    Kobatake, H
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2005, E88A (06) : 1543 - 1548
  • [4] A new method of solving permutation problem in blind source separation for convolutive acoustic signals in frequency-domain
    Wu, Wenyan
    Zhang, Liming
    [J]. 2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 1237 - 1242
  • [5] A robust correlation method for solving permutation problem in frequency domain blind source separation of speech signals
    Reju, V. G.
    Koh, Soo Ngee
    Soon, Ing Yann
    [J]. 2006 IEEE ASIA PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS, 2006, : 1891 - +
  • [6] A robust and precise method for solving the permutation problem of frequency-domain blind source separation
    Sawada, H
    Mukai, R
    Araki, S
    Makino, S
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2004, 12 (05): : 530 - 538
  • [7] An Approach to Solving a Permutation Problem of Frequency Domain Independent Component Analysis for Blind Source Separation of Speech Signals
    Fujieda, Masaru
    Murakami, Takahiro
    Ishida, Yoshihisa
    [J]. PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 18, 2006, 18 : 64 - 68
  • [8] Improved Method for Solving Permutation Problem of Frequency Domain Blind Source Separation
    Wang Weihua
    Huang Fenggang
    [J]. 2008 6TH IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS, VOLS 1-3, 2008, : 670 - 673
  • [9] AN EXPECTATION-MAXIMIZATION METHOD FOR THE PERMUTATION PROBLEM IN FREQUENCY-DOMAIN BLIND SOURCE SEPARATION
    Ngo, Thom-Thi
    Nam, Seung-Hyon
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 17 - 20
  • [10] A robust approach to the permutation problem of frequency-domain blind source separation
    Sawada, H
    Mukai, R
    Araki, S
    Makino, S
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS: SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO AND ELECTROACOUSTICS MULTIMEDIA SIGNAL PROCESSING, 2003, : 381 - 384