Spatial location priors for Gaussian model based reverberant audio source separation

被引:0
|
作者
Ngoc Q K Duong
Emmanuel Vincent
Rémi Gribonval
机构
[1] Technicolor Rennes Research & Innovation Center,
[2] Inria,undefined
[3] Inria,undefined
关键词
Audio source separation; Spatial covariance; EM algorithm; Probabilistic priors; Inverse-Wishart; Gaussian;
D O I
暂无
中图分类号
学科分类号
摘要
We consider the Gaussian framework for reverberant audio source separation, where the sources are modeled in the time-frequency domain by their short-term power spectra and their spatial covariance matrices. We propose two alternative probabilistic priors over the spatial covariance matrices which are consistent with the theory of statistical room acoustics and we derive expectation-maximization algorithms for maximum a posteriori (MAP) estimation. We argue that these algorithms provide a statistically principled solution to the permutation problem and to the risk of overfitting resulting from conventional maximum likelihood (ML) estimation. We show experimentally that in a semi-informed scenario where the source positions and certain room characteristics are known, the MAP algorithms outperform their ML counterparts. This opens the way to rigorous statistical treatment of this family of models in other scenarios in the future.
引用
收藏
相关论文
共 50 条
  • [21] Reverberant Audio Source Separation via Sparse and Low-Rank Modeling
    Arberet, Simon
    Vandergheynst, Pierre
    IEEE SIGNAL PROCESSING LETTERS, 2014, 21 (04) : 404 - 408
  • [22] A SOURCE SEPARATION EVALUATION METHOD IN OBJECT-BASED SPATIAL AUDIO
    Liu, Qingju
    Wang, Wenwu
    Jackson, Philip J. B.
    Cox, Trevor J.
    2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 1088 - 1092
  • [23] Blind source separation based on generalized gaussian model
    杨斌
    孔薇
    周越
    Journal of Harbin Institute of Technology, 2007, (03) : 362 - 367
  • [24] Blind source separation based on generalized Gaussian model
    Information Engineering College, Shanghai Maritime University, Shanghai 200135, China
    不详
    J. Harbin Inst. Technol., 2007, 3 (362-367):
  • [25] Angle-based virtual source location representation for spatial audio coding
    Beack, S
    Seo, J
    Moon, H
    Kang, K
    Hahn, M
    ETRI JOURNAL, 2006, 28 (02) : 219 - 222
  • [26] Reverberant Audio Blind Source Separation via Local Convolutive Independent Vector Analysis
    Feng, Fangchen
    Begdadi, Azeddine
    2020 IEEE 22ND INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2020,
  • [27] Underdetermined Reverberant Audio-Source Separation Through Improved Expectation–Maximization Algorithm
    Yuan Xie
    Kan Xie
    Junjie Yang
    Zongze Wu
    Shengli Xie
    Circuits, Systems, and Signal Processing, 2019, 38 : 2877 - 2889
  • [28] Video-Aided Model-Based Source Separation in Real Reverberant Rooms
    Khan, Muhammad Salman
    Naqvi, Syed Mohsen
    Ata-ur-Rehman
    Wang, Wenwu
    Chambers, Jonathon
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (09): : 1900 - 1912
  • [29] USING SPATIAL GAUSSIAN PRIORS TO MODEL HETEROGENEITY IN ENVIRONMENTAL EPIDEMIOLOGY
    LAWSON, AB
    STATISTICIAN, 1994, 43 (01): : 69 - 76
  • [30] Model-Based STFT Phase Recovery for Audio Source Separation
    Magron, Paul
    Badeau, Roland
    David, Bertrand
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (06) : 1091 - 1101