Student's t Source and Mixing Models for Multichannel Audio Source Separation

被引:8
|
作者
Leglaive, Simon [1 ]
Badeau, Roland [1 ]
Richard, Gael [1 ]
机构
[1] Univ Paris Saclay, Telecom ParisTech, LTCI, F-75013 Paris, France
关键词
Audio source separation; multichannel reverberant mixtures; Student's t distribution; statistical room acoustics; non-negative matrix factorization; variational inference; NONNEGATIVE MATRIX FACTORIZATION; FREQUENCY-RESPONSE CURVES; CONVOLUTIVE MIXTURES; BLIND SEPARATION; APPROXIMATION; INFORMATION; SPARSE;
D O I
10.1109/TASLP.2018.2813011
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a Bayesian framework for under-determined audio source separation in multichannel reverberant mixtures. We model the source signals as Student's t latent random variables in a time-frequency domain. The specific structure of musical signals in this domain is exploited by means of a nonnegative matrix factorization model. Conversely, we design the mixing model in the time domain. In addition to leading to an exact representation of the convolutive mixing process, this approach allows us to develop simple probabilistic priors for the mixing filters. Indeed, as those filters correspond to room responses they exhibit a simple characteristic structure in the time domain that can be used to guide their estimation. We also rely on the Student's t distribution for modeling the impulse response of the mixing filters. From this model, we develop a variational inference algorithm in order to perform source separation. The experimental evaluation demonstrates the potential of this approach for separating multichannel reverberant mixtures.
引用
收藏
页码:1150 / 1164
页数:15
相关论文
共 50 条
  • [1] Semi-Blind Student's t Source Separation for Multichannel Audio Convolutive Mixtures
    Leglaive, Simon
    Badeau, Roland
    Richard, Gael
    [J]. 2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 2259 - 2263
  • [2] Bayesian Multichannel Audio Source Separation Based on Integrated Source and Spatial Models
    Itakura, Kousuke
    Bando, Yoshiaki
    Nakamura, Eita
    Itoyama, Katsutoshi
    Yoshii, Kazuyoshi
    Kawahara, Tatsuya
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (04) : 831 - 846
  • [3] STUDENT'S T MULTICHANNEL NONNEGATIVE MATRIX FACTORIZATION FOR BLIND SOURCE SEPARATION
    Kitamura, Koichi
    Bando, Yoshiaki
    Itoyama, Katsutoshi
    Yoshii, Kazuyoshi
    [J]. 2016 IEEE INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2016,
  • [4] Multichannel Audio Source Separation with Independent Deeply Learned Matrix Analysis Using Product of Source Models
    Hasumi, Takuya
    Nakamura, Tomohiko
    Takamune, Norihiro
    Saruwatari, Hiroshi
    Kitamura, Daichi
    Takahashi, Yu
    Kondo, Kazunobu
    [J]. 2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 1226 - 1233
  • [5] Blind Source Separation for Convolutive Audio Mixing
    Rosebell, V. Jerine Rini
    Sugumar, D.
    Shindu
    Sherin
    [J]. INFORMATION TECHNOLOGY AND MOBILE COMMUNICATION, 2011, 147 : 473 - 476
  • [6] Multichannel Audio Source Separation With Probabilistic Reverberation Priors
    Leglaive, Simon
    Badeau, Roland
    Richard, Gael
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (12) : 2453 - 2465
  • [7] MULTICHANNEL AUDIO SOURCE SEPARATION WITH PROBABILISTIC REVERBERATION MODELING
    Leglaive, Simon
    Badeau, Roland
    Richard, Gael
    [J]. 2015 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2015,
  • [8] Multichannel Audio Source Separation With Deep Neural Networks
    Nugraha, Aditya Arie
    Liutkus, Antoine
    Vincent, Emmanuel
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (09) : 1652 - 1664
  • [9] ALPHA-STABLE MULTICHANNEL AUDIO SOURCE SEPARATION
    Leglaive, Simon
    Simsekli, Umut
    Liutkus, Antoine
    Badeau, Roland
    Richard, Gael
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 576 - 580
  • [10] On the Use of Latent Mixing Filters in Audio Source Separation
    Girin, Laurent
    Badeau, Roland
    [J]. LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION (LVA/ICA 2017), 2017, 10169 : 225 - 235