Semi-Blind Student's t Source Separation for Multichannel Audio Convolutive Mixtures

被引：0

作者：

Leglaive, Simon ^{[1
]}

Badeau, Roland ^{[1
]}

Richard, Gael ^{[1
]}

机构：

[1] Univ Paris Saclay, LTCI, Telecom ParisTech, F-75013 Paris, France

来源：

2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO) | 2017年

关键词：

Under-determined audio source separation; multichannel convolutive mixture; Student's t distribution; non-negative matrix factorization; variational inference; NONNEGATIVE MATRIX FACTORIZATION; INFORMATION; SPARSE;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This paper addresses the problem of multichannel audio source separation in under-determined convolutive mixtures. We target a semi-blind scenario assuming that the mixing filters are known. The convolutive mixing process is exactly modeled using the time-domain impulse responses of the mixing filters. We propose a Student's t time-frequency source model based on non-negative matrix factorization (NMF). The Student's t distribution being heavy-tailed with respect to the Gaussian, it provides some flexibility in the modeling of the sources. We also study a simpler Student's t sparse source model within the same general source separation framework. The inference procedure relies on a variational expectation-maximization algorithm. Experiments show the advantage of using an NMF model compared with the sparse source model. While the Student's t NMF source model leads to slightly better results than our previous Gaussian one, we demonstrate the superiority of our method over two other approaches from the literature.

引用

页码：2259 / 2263

页数：5

共 50 条

[1] Semi-blind source separation for convolutive mixtures based on frequency invariant transformation
Liu, W
Mandic, DP
2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 285 - 288
[2] Semi-blind maximum likelihood separation of linear convolutive mixtures
Xavier, Joao
Barroso, Victor
2000, IEEE, Los Alamitos, CA, United States
[3] Semi-blind maximum likelihood separation of linear convolutive mixtures
Xavier, J
Barroso, V
PROCEEDINGS OF THE TENTH IEEE WORKSHOP ON STATISTICAL SIGNAL AND ARRAY PROCESSING, 2000, : 329 - 333
[4] Multichannel blind deconvolution for source separation in convolutive mixtures of speech
Kokkinakis, K
Nandi, AK
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (01): : 200 - 212
[5] Multichannel Nonnegative Matrix Factorization in Convolutive Mixtures for Audio Source Separation
Ozerov, Alexey
Fevotte, Cedric
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (03): : 550 - 563
[6] MULTICHANNEL NONNEGATIVE MATRIX FACTORIZATION IN CONVOLUTIVE MIXTURES. WITH APPLICATION TO BLIND AUDIO SOURCE SEPARATION.
Ozerov, Alexey
Fevotte, Cedric
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3137 - +
[7] Audio source separation of convolutive mixtures
Mitianoudis, N
Davies, ME
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (05): : 489 - 497
[8] A frequency domain method for blind source separation of convolutive audio mixtures
Rahbar, K
Reilly, JP
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (05): : 832 - 844
[9] Student's t Source and Mixing Models for Multichannel Audio Source Separation
Leglaive, Simon
Badeau, Roland
Richard, Gael
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (06) : 1150 - 1164
[10] Blind source separation of convolutive mixtures
Serviere, C
8TH IEEE SIGNAL PROCESSING WORKSHOP ON STATISTICAL SIGNAL AND ARRAY PROCESSING, PROCEEDINGS, 1996, : 316 - 319

← 1 2 3 4 5 →