A frequency domain method for blind source separation of convolutive audio mixtures

被引:82
|
作者
Rahbar, K [1 ]
Reilly, JP [1 ]
机构
[1] McMaster Univ, Dept Elect & Comp Engn, Hamilton, ON L8S 4K1, Canada
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 2005年 / 13卷 / 05期
基金
加拿大自然科学与工程研究理事会;
关键词
audio enhancement; frequency domain blind; source separation; joint diagonalization; permutation ambiguity;
D O I
10.1109/TSA.2005.851925
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we propose a new frequency domain approach to blind source separation (BSS) of audio signals mixed in a reverberant environment. We propose a joint diagonalization procedure on the cross power spectral density matrices of the signals at the output of the mixing system to identify the mixing system at each frequency bin up to a scale and permutation ambiguity. The frequency domain joint diagonalization is performed using a new and quickly converging algorithm which uses an alternating least-squares (ALS) optimization method. The inverse of the mixing system is then used to separate the sources. An efficient dyadic algorithm to resolve the frequency dependent permutation ambiguities that exploits the inherent nonstationarity of the sources is presented. The effect of the unknown scaling ambiguities is partially resolved using an initialization procedure for the ALS algorithm. The performance of the proposed algorithm is demonstrated by experiments conducted in real reverberant rooms. Performance comparisons are made with previous methods.
引用
收藏
页码:832 / 844
页数:13
相关论文
共 50 条
  • [21] A Gauss-Newton method for blind source separation of convolutive mixtures
    Cruces, S
    Castedo, L
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 2093 - 2096
  • [22] Perceptually motivated blind source separation of convolutive mixtures
    Guddeti, RR
    Mulgrew, B
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 273 - 276
  • [23] An online blind source separation for convolutive acoustic signals in frequency-domain
    Wu Wenyan
    Zhang Liming
    ADVANCES IN NATURAL COMPUTATION, PT 1, 2006, 4221 : 451 - 460
  • [24] Blind Source Separation for Convolutive Mixtures with Neural Networks
    Kirei, Botond Sandor
    Topa, Marina Dana
    Muresan, Irina
    Homana, Ioana
    Toma, Norbert
    ADVANCES IN ELECTRICAL AND COMPUTER ENGINEERING, 2011, 11 (01) : 63 - 68
  • [25] Semi-Blind Student's t Source Separation for Multichannel Audio Convolutive Mixtures
    Leglaive, Simon
    Badeau, Roland
    Richard, Gael
    2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 2259 - 2263
  • [26] Fundamental limitation of frequency domain Blind Source Separation for convolutive mixture of speech
    Shoko, A
    Shoji, M
    Nishikawa, T
    Saruwatari, H
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 2737 - 2740
  • [27] Multi-inconsecutive-frames moving average for the frequency-domain blind source separation of convolutive mixtures
    Chao, Wang
    Yong, Fang
    2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 11 - 16
  • [28] Convolutive blind source separation for more than two sources in the frequency domain
    Sawada, H
    Mukai, R
    Araki, S
    Makino, S
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE AND MULTIDIMENSIONAL SIGNAL PROCESSING SPECIAL SESSIONS, 2004, : 885 - 888
  • [29] A Time–Frequency Domain Blind Source Separation Method for Underdetermined Instantaneous Mixtures
    Tianliang Peng
    Yang Chen
    Zengli Liu
    Circuits, Systems, and Signal Processing, 2015, 34 : 3883 - 3895
  • [30] Underdetermined blind separation of audio sources from the time-frequency representation of their convolutive mixtures
    Aissa-El-Bey, Abdeldjalil
    Abed-Meraim, Karim
    Grenier, Yves
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PTS 1-3, PROCEEDINGS, 2007, : 153 - 156