Spatial location priors for Gaussian model based reverberant audio source separation

被引:0
|
作者
Ngoc Q K Duong
Emmanuel Vincent
Rémi Gribonval
机构
[1] Technicolor Rennes Research & Innovation Center,
[2] Inria,undefined
[3] Inria,undefined
关键词
Audio source separation; Spatial covariance; EM algorithm; Probabilistic priors; Inverse-Wishart; Gaussian;
D O I
暂无
中图分类号
学科分类号
摘要
We consider the Gaussian framework for reverberant audio source separation, where the sources are modeled in the time-frequency domain by their short-term power spectra and their spatial covariance matrices. We propose two alternative probabilistic priors over the spatial covariance matrices which are consistent with the theory of statistical room acoustics and we derive expectation-maximization algorithms for maximum a posteriori (MAP) estimation. We argue that these algorithms provide a statistically principled solution to the permutation problem and to the risk of overfitting resulting from conventional maximum likelihood (ML) estimation. We show experimentally that in a semi-informed scenario where the source positions and certain room characteristics are known, the MAP algorithms outperform their ML counterparts. This opens the way to rigorous statistical treatment of this family of models in other scenarios in the future.
引用
收藏
相关论文
共 50 条
  • [41] Audio Source Separation Based on Residual Reprojection
    Cho, Choongsang
    Kim, Je Woo
    Lee, Sangkeun
    ETRI JOURNAL, 2015, 37 (04) : 780 - 786
  • [42] Beyond the Narrowband Approximation: Wideband Convex Methods for Under-Determined Reverberant Audio Source Separation
    Kowalski, Matthieu
    Vincent, Emmanuel
    Gribonval, Remi
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (07): : 1818 - 1829
  • [43] Gaussian mixture model for underdetermined source separation
    Zhang, YY
    Shi, XZ
    Lei, JY
    Xu, HX
    Huang, K
    Chen, CH
    PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS AND BRAIN, VOLS 1-3, 2005, : 1965 - 1969
  • [44] AN ACOUSTICALLY-MOTIVATED SPATIAL PRIOR FOR UNDER-DETERMINED REVERBERANT SOURCE SEPARATION
    Duong, Ngoc Q. K.
    Vincent, Emmanuel
    Gribonval, Remi
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 9 - 12
  • [45] REVERBERANT SPEECH SEPARATION BASED ON AUDIO-VISUAL DICTIONARY LEARNING AND BINAURAL CUES
    Liu, Qingju
    Wang, Wenwu
    Jackson, Philip
    Barnard, Mark
    2012 IEEE STATISTICAL SIGNAL PROCESSING WORKSHOP (SSP), 2012, : 664 - 667
  • [46] A Bayesian Hierarchical Model for Blind Audio Source Separation
    Laufer, Yaron
    Gannot, Sharon
    28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 276 - 280
  • [47] VARIATIONAL BAYESIAN MODEL AVERAGING FOR AUDIO SOURCE SEPARATION
    Jaureguiberry, Xabier
    Vincent, Emmanuel
    Richard, Gael
    2014 IEEE WORKSHOP ON STATISTICAL SIGNAL PROCESSING (SSP), 2014, : 33 - 36
  • [48] Informed Audio Source Separation Using Linearly Constrained Spatial Filters
    Gorlow, Stanislaw
    Marchand, Sylvain
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (01): : 1 - 11
  • [49] DEEP BAYESIAN UNSUPERVISED SOURCE SEPARATION BASED ON A COMPLEX GAUSSIAN MIXTURE MODEL
    Bando, Yoshiaki
    Sasaki, Yoko
    Yoshii, Kazuyoshi
    2019 IEEE 29TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2019,
  • [50] Under-determined reverberant audio source separation using Bayesian Non-negative Matrix Factorization
    Mirzaei, Sayeh
    Van Hamme, Hugo
    Norouzi, Yaser
    SPEECH COMMUNICATION, 2016, 81 : 129 - 137