Spatial location priors for Gaussian model based reverberant audio source separation

被引:0
|
作者
Ngoc Q K Duong
Emmanuel Vincent
Rémi Gribonval
机构
[1] Technicolor Rennes Research & Innovation Center,
[2] Inria,undefined
[3] Inria,undefined
关键词
Audio source separation; Spatial covariance; EM algorithm; Probabilistic priors; Inverse-Wishart; Gaussian;
D O I
暂无
中图分类号
学科分类号
摘要
We consider the Gaussian framework for reverberant audio source separation, where the sources are modeled in the time-frequency domain by their short-term power spectra and their spatial covariance matrices. We propose two alternative probabilistic priors over the spatial covariance matrices which are consistent with the theory of statistical room acoustics and we derive expectation-maximization algorithms for maximum a posteriori (MAP) estimation. We argue that these algorithms provide a statistically principled solution to the permutation problem and to the risk of overfitting resulting from conventional maximum likelihood (ML) estimation. We show experimentally that in a semi-informed scenario where the source positions and certain room characteristics are known, the MAP algorithms outperform their ML counterparts. This opens the way to rigorous statistical treatment of this family of models in other scenarios in the future.
引用
收藏
相关论文
共 50 条
  • [1] Spatial location priors for Gaussian model based reverberant audio source separation
    Duong, Ngoc Q. K.
    Vincent, Emmanuel
    Gribonval, Remi
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2013,
  • [2] Reverberant Source Separation Using NTF With Delayed Subsources and Spatial Priors
    Fras, Mieszko
    Kowalczyk, Konrad
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 1954 - 1967
  • [3] AUDIO SOURCE SEPARATION WITH MAGNITUDE PRIORS: THE BEADS MODEL
    Liutkus, Antoine
    Rohlfing, Christian
    Deleforge, Antoine
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 56 - 60
  • [4] ESTIMATION OF THE SPATIAL INFORMATION IN GAUSSIAN MODEL BASED AUDIO SOURCE SEPARATION USING WEIGHTED SPECTRAL BASES
    Fakhry, Mahmoud
    Svaizer, Piergiorgio
    Omologo, Maurizio
    2016 24TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2016, : 1188 - 1192
  • [5] BAYESIAN ANISOTROPIC GAUSSIAN MODEL FOR AUDIO SOURCE SEPARATION
    Magron, Paul
    Virtanen, Tuomas
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 166 - 170
  • [6] Under-Determined Reverberant Audio Source Separation Using a Full-Rank Spatial Covariance Model
    Duong, Ngoc Q. K.
    Vincent, Emmanuel
    Gribonval, Remi
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (07): : 1830 - 1840
  • [7] SPARSE GAUSSIAN PROCESS AUDIO SOURCE SEPARATION USING SPECTRUM PRIORS IN THE TIME-DOMAIN
    Alvarado, Pablo A.
    Alvarez, Mauricio A.
    Stowell, Dan
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 995 - 999
  • [8] Sparse Reverberant Audio Source Separation via Reweighted Analysis
    Arberet, Simon
    Vandergheynst, Pierre
    Carrillo, Rafael E.
    Thiran, Jean-Philippe
    Wiaux, Yves
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (07): : 1391 - 1402
  • [9] Multichannel Audio Source Separation With Probabilistic Reverberation Priors
    Leglaive, Simon
    Badeau, Roland
    Richard, Gael
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (12) : 2453 - 2465
  • [10] Unsupervised Audio Source Separation using Generative Priors
    Narayanaswamy, Vivek
    Thiagarajan, Jayaraman J.
    Anirudh, Rushil
    Spanias, Andreas
    INTERSPEECH 2020, 2020, : 2657 - 2661