Underdetermined Convolutive Blind Source Separation via Frequency Bin-Wise Clustering and Permutation Alignment

被引:271
|
作者
Sawada, Hiroshi [1 ]
Araki, Shoko [1 ]
Makino, Shoji [2 ]
机构
[1] NTT Corp, NTT Commun Sci Labs, Kyoto 6190237, Japan
[2] Univ Tsukuba, Tsukuba, Ibaraki 3058577, Japan
关键词
Blind source separation (BSS); convolutive mixture; expectation-maximization (EM) algorithm; permutation problem; short-time Fourier transform (STFT); sparseness; time-frequency (T-F) masking; MIXTURES; EM;
D O I
10.1109/TASL.2010.2051355
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a blind source separation method for convolutive mixtures of speech/audio sources. The method can even be applied to an underdetermined case where there are fewer microphones than sources. The separation operation is performed in the frequency domain and consists of two stages. In the first stage, frequency-domain mixture samples are clustered into each source by an expectation-maximization (EM) algorithm. Since the clustering is performed in a frequency bin-wise manner, the permutation ambiguities of the bin-wise clustered samples should be aligned. This is solved in the second stage by using the probability on how likely each sample belongs to the assigned class. This two-stage structure makes it possible to attain a good separation even under reverberant conditions. Experimental results for separating four speech signals with three microphones under reverberant conditions show the superiority of the new method over existing methods. We also report separation results for a benchmark data set and live recordings of speech mixtures.
引用
收藏
页码:516 / 527
页数:12
相关论文
共 50 条
  • [1] Bin-Wise Combination of Time-Frequency Masking and Beamforming for Convolutive Source Separation
    Bella, Mostafa
    Saylani, Hicham
    Hosseini, Shahram
    Deville, Yannick
    [J]. 2022 IEEE 24TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2022,
  • [2] Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
    Reju, Vaninirappuputhenpurayil Gopalan
    Koh, Soo Ngee
    Soon, Ing Yann
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (01): : 101 - 116
  • [3] Measuring Dependence for Permutation Alignment in Convolutive Blind Source Separation
    Ma, Baoze
    Zhang, Tianqi
    An, Zeliang
    Yi, Chen
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2022, 69 (03) : 1982 - 1986
  • [4] A New Method for Underdetermined Convolutive Blind Source Separation in Frequency Domain
    Chen, Yongqiang
    Liu, Jun
    [J]. PROCEEDINGS OF 2012 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2012), 2012, : 1484 - 1487
  • [5] Measuring dependence of bin-wise separated signals for permutation alignment in frequency-domain BSS
    Sawada, Hiroshi
    Araki, Shoko
    Makino, Shoji
    [J]. 2007 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11, 2007, : 3247 - 3250
  • [6] A frequency bin-wise nonlinear masking algorithm in convolutive mixtures for speech segregation
    Chi, Tai-Shih
    Huang, Ching-Wen
    Chou, Wen-Sheng
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2012, 131 (05): : EL361 - EL367
  • [7] A NEW CLUSTERING APPROACH FOR SOLVING THE PERMUTATION PROBLEM IN CONVOLUTIVE BLIND SOURCE SEPARATION
    Mazur, Radoslaw
    Jungmann, Jan Ole
    Mertins, Alfred
    [J]. 2013 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2013,
  • [8] A probabilistic approach for blind source separation of underdetermined convolutive mixtures
    Peterson, JM
    Kadambe, S
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL VI, PROCEEDINGS: SIGNAL PROCESSING THEORY AND METHODS, 2003, : 581 - 584
  • [9] MAP-based Permutation Alignment for Underdeter ined Convolutive Blind Source Separation
    Cho, Janghoon
    Yoo, Chang D.
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS-ASIA (ICCE-ASIA), 2016,
  • [10] A probabilistic approach for blind source separation of underdetermined convolutive mixtures
    Peterson, JM
    Kadambe, S
    [J]. 2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I, PROCEEDINGS, 2003, : 861 - 864