MULTICHANNEL AUDIO SOURCE SEPARATION WITH PROBABILISTIC REVERBERATION MODELING

被引:0
|
作者
Leglaive, Simon [1 ]
Badeau, Roland [1 ]
Richard, Gael [1 ]
机构
[1] Telecom ParisTech, Inst Mines Telecom, CNRS LTCI, Paris, France
关键词
Blind audio source separation; Under-determined convolutive mixtures; Probabilistic prior; MAP estimation; EM algorithm; NONNEGATIVE MATRIX FACTORIZATION; BLIND; INFORMATION; MIXTURES;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper we show that considering early contributions of mixing filters through a probabilistic prior can help blind source separation in reverberant recording conditions. By modeling mixing filters as the direct path plus R-1 reflections, we represent the propagation from a source to a mixture channel as an autoregressive process of order R in the frequency domain. This model is used as a prior to derive a Maximum A Posteriori (MAP) estimation of the mixing filters using the Expectation-Maximization (EM) algorithm. Experimental results over reverberant synthetic mixtures and live recordings show that MAP estimation with this prior provides better separation results than a Maximum Likelihood (ML) estimation.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Multichannel Audio Source Separation With Probabilistic Reverberation Priors
    Leglaive, Simon
    Badeau, Roland
    Richard, Gael
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (12) : 2453 - 2465
  • [2] Gaussian Modeling-Based Multichannel Audio Source Separation Exploiting Generic Source Spectral Model
    Thanh Thi Hien Duong
    Duong, Ngoc Q. K.
    Phuong Cong Nguyen
    Cuong Quoc Nguyen
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (01) : 32 - 43
  • [3] Multichannel Audio Source Separation With Deep Neural Networks
    Nugraha, Aditya Arie
    Liutkus, Antoine
    Vincent, Emmanuel
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (09) : 1652 - 1664
  • [4] ALPHA-STABLE MULTICHANNEL AUDIO SOURCE SEPARATION
    Leglaive, Simon
    Simsekli, Umut
    Liutkus, Antoine
    Badeau, Roland
    Richard, Gael
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 576 - 580
  • [5] SEPNET: A DEEP SEPARATION MATRIX PREDICTION NETWORK FOR MULTICHANNEL AUDIO SOURCE SEPARATION
    Inoue, Shota
    Kameoka, Hirokazu
    Li, Li
    Makino, Shoji
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 191 - 195
  • [6] Bayesian Multichannel Audio Source Separation Based on Integrated Source and Spatial Models
    Itakura, Kousuke
    Bando, Yoshiaki
    Nakamura, Eita
    Itoyama, Katsutoshi
    Yoshii, Kazuyoshi
    Kawahara, Tatsuya
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (04) : 831 - 846
  • [7] Student's t Source and Mixing Models for Multichannel Audio Source Separation
    Leglaive, Simon
    Badeau, Roland
    Richard, Gael
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (06) : 1150 - 1164
  • [8] Multichannel Audio Source Separation Exploiting NMF-Based Generic Source Spectral Model in Gaussian Modeling Framework
    Thanh Thi Hien Duong
    Duong, Ngoc Q. K.
    Cong-Phuong Nguyen
    Quoc-Cuong Nguyen
    LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION (LVA/ICA 2018), 2018, 10891 : 547 - 557
  • [9] BAYESIAN MULTICHANNEL NONNEGATIVE MATRIX FACTORIZATION FOR AUDIO SOURCE SEPARATION AND LOCALIZATION
    Itakura, Kousuke
    Bando, Yoshiaki
    Nakamura, Eita
    Itoyama, Katsutoshi
    Yoshii, Kazuyoshi
    Kawahara, Tatsuya
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 551 - 555
  • [10] Independent Deeply Learned Matrix Analysis for Multichannel Audio Source Separation
    Mogami, Shinichi
    Sumino, Hayato
    Kitamura, Daichi
    Takamune, Norihiro
    Takamichi, Shinnosuke
    Saruwatari, Hiroshi
    Ono, Nobutaka
    2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 1557 - 1561