A frequency domain method for blind source separation of convolutive audio mixtures

被引:82
|
作者
Rahbar, K [1 ]
Reilly, JP [1 ]
机构
[1] McMaster Univ, Dept Elect & Comp Engn, Hamilton, ON L8S 4K1, Canada
来源
基金
加拿大自然科学与工程研究理事会;
关键词
audio enhancement; frequency domain blind; source separation; joint diagonalization; permutation ambiguity;
D O I
10.1109/TSA.2005.851925
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we propose a new frequency domain approach to blind source separation (BSS) of audio signals mixed in a reverberant environment. We propose a joint diagonalization procedure on the cross power spectral density matrices of the signals at the output of the mixing system to identify the mixing system at each frequency bin up to a scale and permutation ambiguity. The frequency domain joint diagonalization is performed using a new and quickly converging algorithm which uses an alternating least-squares (ALS) optimization method. The inverse of the mixing system is then used to separate the sources. An efficient dyadic algorithm to resolve the frequency dependent permutation ambiguities that exploits the inherent nonstationarity of the sources is presented. The effect of the unknown scaling ambiguities is partially resolved using an initialization procedure for the ALS algorithm. The performance of the proposed algorithm is demonstrated by experiments conducted in real reverberant rooms. Performance comparisons are made with previous methods.
引用
收藏
页码:832 / 844
页数:13
相关论文
共 50 条
  • [1] Blind source separation of convolutive mixtures of speech in frequency domain
    Makino, S
    Sawada, H
    Mukai, R
    Araki, S
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2005, E88A (07) : 1640 - 1655
  • [2] Near-field frequency domain blind source separation for convolutive mixtures
    Mukai, R
    Sawada, H
    Araki, S
    Makino, S
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PROCEEDINGS: AUDIO AND ELECTROACOUSTICS SIGNAL PROCESSING FOR COMMUNICATIONS, 2004, : 49 - 52
  • [3] The fundamental limitation of frequency domain blind source separation for convolutive mixtures of speech
    Araki, S
    Mukai, R
    Makino, S
    Nishikawa, T
    Saruwatari, H
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (02): : 109 - 116
  • [4] A New Method for Underdetermined Convolutive Blind Source Separation in Frequency Domain
    Chen, Yongqiang
    Liu, Jun
    [J]. PROCEEDINGS OF 2012 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2012), 2012, : 1484 - 1487
  • [5] Audio source separation of convolutive mixtures
    Mitianoudis, N
    Davies, ME
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (05): : 489 - 497
  • [6] A two-stage frequency-domain blind source separation method for underdetermined convolutive mixtures
    Sawada, Hiroshi
    Araki, Shoko
    Makino, Shoji
    [J]. 2007 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, 2007, : 157 - 160
  • [7] A fast and efficient frequency-domain method for convolutive Blind Source Separation
    Xie, Peng
    Grant, Steven L.
    [J]. 2008 IEEE REGION 5 CONFERENCE, 2008, : 138 - 141
  • [8] Blind source separation of convolutive mixtures
    Serviere, C
    [J]. 8TH IEEE SIGNAL PROCESSING WORKSHOP ON STATISTICAL SIGNAL AND ARRAY PROCESSING, PROCEEDINGS, 1996, : 316 - 319
  • [9] BLIND SOURCE SEPARATION FOR CONVOLUTIVE MIXTURES
    THI, HLN
    JUTTEN, C
    [J]. SIGNAL PROCESSING, 1995, 45 (02) : 209 - 229
  • [10] Blind source separation of convolutive mixtures
    Makino, Shoji
    [J]. INDEPENDENT COMPONENT ANALYSES, WAVELETS, UNSUPERVISED SMART SENSORS, AND NEURAL NETWORKS IV, 2006, 6247