A frequency domain method for blind source separation of convolutive audio mixtures

被引：82

作者：

Rahbar, K ^{[1
]}

Reilly, JP ^{[1
]}

机构：

[1] McMaster Univ, Dept Elect & Comp Engn, Hamilton, ON L8S 4K1, Canada

来源：

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 2005年 / 13卷 / 05期

基金：

加拿大自然科学与工程研究理事会;

关键词：

audio enhancement; frequency domain blind; source separation; joint diagonalization; permutation ambiguity;

D O I：

10.1109/TSA.2005.851925

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, we propose a new frequency domain approach to blind source separation (BSS) of audio signals mixed in a reverberant environment. We propose a joint diagonalization procedure on the cross power spectral density matrices of the signals at the output of the mixing system to identify the mixing system at each frequency bin up to a scale and permutation ambiguity. The frequency domain joint diagonalization is performed using a new and quickly converging algorithm which uses an alternating least-squares (ALS) optimization method. The inverse of the mixing system is then used to separate the sources. An efficient dyadic algorithm to resolve the frequency dependent permutation ambiguities that exploits the inherent nonstationarity of the sources is presented. The effect of the unknown scaling ambiguities is partially resolved using an initialization procedure for the ALS algorithm. The performance of the proposed algorithm is demonstrated by experiments conducted in real reverberant rooms. Performance comparisons are made with previous methods.

引用

页码：832 / 844

页数：13

共 50 条

[21] A Gauss-Newton method for blind source separation of convolutive mixtures
Cruces, S
Castedo, L
PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 2093 - 2096
[22] Perceptually motivated blind source separation of convolutive mixtures
Guddeti, RR
Mulgrew, B
2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 273 - 276
[23] An online blind source separation for convolutive acoustic signals in frequency-domain
Wu Wenyan
Zhang Liming
ADVANCES IN NATURAL COMPUTATION, PT 1, 2006, 4221 : 451 - 460
[24] Blind Source Separation for Convolutive Mixtures with Neural Networks
Kirei, Botond Sandor
Topa, Marina Dana
Muresan, Irina
Homana, Ioana
Toma, Norbert
ADVANCES IN ELECTRICAL AND COMPUTER ENGINEERING, 2011, 11 (01) : 63 - 68
[25] Semi-Blind Student's t Source Separation for Multichannel Audio Convolutive Mixtures
Leglaive, Simon
Badeau, Roland
Richard, Gael
2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 2259 - 2263
[26] Fundamental limitation of frequency domain Blind Source Separation for convolutive mixture of speech
Shoko, A
Shoji, M
Nishikawa, T
Saruwatari, H
2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 2737 - 2740
[27] Multi-inconsecutive-frames moving average for the frequency-domain blind source separation of convolutive mixtures
Chao, Wang
Yong, Fang
2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 11 - 16
[28] Convolutive blind source separation for more than two sources in the frequency domain
Sawada, H
Mukai, R
Araki, S
Makino, S
2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE AND MULTIDIMENSIONAL SIGNAL PROCESSING SPECIAL SESSIONS, 2004, : 885 - 888
[29] A Time–Frequency Domain Blind Source Separation Method for Underdetermined Instantaneous Mixtures
Tianliang Peng
Yang Chen
Zengli Liu
Circuits, Systems, and Signal Processing, 2015, 34 : 3883 - 3895
[30] Underdetermined blind separation of audio sources from the time-frequency representation of their convolutive mixtures
Aissa-El-Bey, Abdeldjalil
Abed-Meraim, Karim
Grenier, Yves
2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PTS 1-3, PROCEEDINGS, 2007, : 153 - 156

← 1 2 3 4 5 →