Student's t Source and Mixing Models for Multichannel Audio Source Separation

被引：8

作者：

Leglaive, Simon ^{[1
]}

Badeau, Roland ^{[1
]}

Richard, Gael ^{[1
]}

机构：

[1] Univ Paris Saclay, Telecom ParisTech, LTCI, F-75013 Paris, France

来源：

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2018年 / 26卷 / 06期

关键词：

Audio source separation; multichannel reverberant mixtures; Student's t distribution; statistical room acoustics; non-negative matrix factorization; variational inference; NONNEGATIVE MATRIX FACTORIZATION; FREQUENCY-RESPONSE CURVES; CONVOLUTIVE MIXTURES; BLIND SEPARATION; APPROXIMATION; INFORMATION; SPARSE;

D O I：

10.1109/TASLP.2018.2813011

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper presents a Bayesian framework for under-determined audio source separation in multichannel reverberant mixtures. We model the source signals as Student's t latent random variables in a time-frequency domain. The specific structure of musical signals in this domain is exploited by means of a nonnegative matrix factorization model. Conversely, we design the mixing model in the time domain. In addition to leading to an exact representation of the convolutive mixing process, this approach allows us to develop simple probabilistic priors for the mixing filters. Indeed, as those filters correspond to room responses they exhibit a simple characteristic structure in the time domain that can be used to guide their estimation. We also rely on the Student's t distribution for modeling the impulse response of the mixing filters. From this model, we develop a variational inference algorithm in order to perform source separation. The experimental evaluation demonstrates the potential of this approach for separating multichannel reverberant mixtures.

引用

页码：1150 / 1164

页数：15

共 50 条

[1] Semi-Blind Student's t Source Separation for Multichannel Audio Convolutive Mixtures
Leglaive, Simon
Badeau, Roland
Richard, Gael
[J]. 2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 2259 - 2263
[2] Bayesian Multichannel Audio Source Separation Based on Integrated Source and Spatial Models
Itakura, Kousuke
Bando, Yoshiaki
Nakamura, Eita
Itoyama, Katsutoshi
Yoshii, Kazuyoshi
Kawahara, Tatsuya
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (04) : 831 - 846
[3] STUDENT'S T MULTICHANNEL NONNEGATIVE MATRIX FACTORIZATION FOR BLIND SOURCE SEPARATION
Kitamura, Koichi
Bando, Yoshiaki
Itoyama, Katsutoshi
Yoshii, Kazuyoshi
[J]. 2016 IEEE INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2016,
[4] Multichannel Audio Source Separation with Independent Deeply Learned Matrix Analysis Using Product of Source Models
Hasumi, Takuya
Nakamura, Tomohiko
Takamune, Norihiro
Saruwatari, Hiroshi
Kitamura, Daichi
Takahashi, Yu
Kondo, Kazunobu
[J]. 2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 1226 - 1233
[5] Blind Source Separation for Convolutive Audio Mixing
Rosebell, V. Jerine Rini
Sugumar, D.
Shindu
Sherin
[J]. INFORMATION TECHNOLOGY AND MOBILE COMMUNICATION, 2011, 147 : 473 - 476
[6] Multichannel Audio Source Separation With Probabilistic Reverberation Priors
Leglaive, Simon
Badeau, Roland
Richard, Gael
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (12) : 2453 - 2465
[7] MULTICHANNEL AUDIO SOURCE SEPARATION WITH PROBABILISTIC REVERBERATION MODELING
Leglaive, Simon
Badeau, Roland
Richard, Gael
[J]. 2015 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2015,
[8] Multichannel Audio Source Separation With Deep Neural Networks
Nugraha, Aditya Arie
Liutkus, Antoine
Vincent, Emmanuel
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (09) : 1652 - 1664
[9] ALPHA-STABLE MULTICHANNEL AUDIO SOURCE SEPARATION
Leglaive, Simon
Simsekli, Umut
Liutkus, Antoine
Badeau, Roland
Richard, Gael
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 576 - 580
[10] On the Use of Latent Mixing Filters in Audio Source Separation
Girin, Laurent
Badeau, Roland
[J]. LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION (LVA/ICA 2017), 2017, 10169 : 225 - 235

← 1 2 3 4 5 →