Semi-Blind Student's t Source Separation for Multichannel Audio Convolutive Mixtures

被引:0
|
作者
Leglaive, Simon [1 ]
Badeau, Roland [1 ]
Richard, Gael [1 ]
机构
[1] Univ Paris Saclay, LTCI, Telecom ParisTech, F-75013 Paris, France
关键词
Under-determined audio source separation; multichannel convolutive mixture; Student's t distribution; non-negative matrix factorization; variational inference; NONNEGATIVE MATRIX FACTORIZATION; INFORMATION; SPARSE;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper addresses the problem of multichannel audio source separation in under-determined convolutive mixtures. We target a semi-blind scenario assuming that the mixing filters are known. The convolutive mixing process is exactly modeled using the time-domain impulse responses of the mixing filters. We propose a Student's t time-frequency source model based on non-negative matrix factorization (NMF). The Student's t distribution being heavy-tailed with respect to the Gaussian, it provides some flexibility in the modeling of the sources. We also study a simpler Student's t sparse source model within the same general source separation framework. The inference procedure relies on a variational expectation-maximization algorithm. Experiments show the advantage of using an NMF model compared with the sparse source model. While the Student's t NMF source model leads to slightly better results than our previous Gaussian one, we demonstrate the superiority of our method over two other approaches from the literature.
引用
收藏
页码:2259 / 2263
页数:5
相关论文
共 50 条
  • [1] Semi-blind source separation for convolutive mixtures based on frequency invariant transformation
    Liu, W
    Mandic, DP
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 285 - 288
  • [2] Semi-blind maximum likelihood separation of linear convolutive mixtures
    Xavier, J
    Barroso, V
    [J]. PROCEEDINGS OF THE TENTH IEEE WORKSHOP ON STATISTICAL SIGNAL AND ARRAY PROCESSING, 2000, : 329 - 333
  • [3] Multichannel blind deconvolution for source separation in convolutive mixtures of speech
    Kokkinakis, K
    Nandi, AK
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (01): : 200 - 212
  • [4] Multichannel Nonnegative Matrix Factorization in Convolutive Mixtures for Audio Source Separation
    Ozerov, Alexey
    Fevotte, Cedric
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (03): : 550 - 563
  • [5] MULTICHANNEL NONNEGATIVE MATRIX FACTORIZATION IN CONVOLUTIVE MIXTURES. WITH APPLICATION TO BLIND AUDIO SOURCE SEPARATION.
    Ozerov, Alexey
    Fevotte, Cedric
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3137 - +
  • [6] Audio source separation of convolutive mixtures
    Mitianoudis, N
    Davies, ME
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (05): : 489 - 497
  • [7] A frequency domain method for blind source separation of convolutive audio mixtures
    Rahbar, K
    Reilly, JP
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (05): : 832 - 844
  • [8] Student's t Source and Mixing Models for Multichannel Audio Source Separation
    Leglaive, Simon
    Badeau, Roland
    Richard, Gael
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (06) : 1150 - 1164
  • [9] Blind source separation of convolutive mixtures
    Serviere, C
    [J]. 8TH IEEE SIGNAL PROCESSING WORKSHOP ON STATISTICAL SIGNAL AND ARRAY PROCESSING, PROCEEDINGS, 1996, : 316 - 319
  • [10] BLIND SOURCE SEPARATION FOR CONVOLUTIVE MIXTURES
    THI, HLN
    JUTTEN, C
    [J]. SIGNAL PROCESSING, 1995, 45 (02) : 209 - 229