Spatial location priors for Gaussian model based reverberant audio source separation

被引:0
|
作者
Ngoc Q K Duong
Emmanuel Vincent
Rémi Gribonval
机构
[1] Technicolor Rennes Research & Innovation Center,
[2] Inria,undefined
[3] Inria,undefined
关键词
Audio source separation; Spatial covariance; EM algorithm; Probabilistic priors; Inverse-Wishart; Gaussian;
D O I
暂无
中图分类号
学科分类号
摘要
We consider the Gaussian framework for reverberant audio source separation, where the sources are modeled in the time-frequency domain by their short-term power spectra and their spatial covariance matrices. We propose two alternative probabilistic priors over the spatial covariance matrices which are consistent with the theory of statistical room acoustics and we derive expectation-maximization algorithms for maximum a posteriori (MAP) estimation. We argue that these algorithms provide a statistically principled solution to the permutation problem and to the risk of overfitting resulting from conventional maximum likelihood (ML) estimation. We show experimentally that in a semi-informed scenario where the source positions and certain room characteristics are known, the MAP algorithms outperform their ML counterparts. This opens the way to rigorous statistical treatment of this family of models in other scenarios in the future.
引用
收藏
相关论文
共 50 条
  • [31] Underdetermined Reverberant Audio-Source Separation Through Improved Expectation-Maximization Algorithm
    Xie, Yuan
    Xie, Kan
    Yang, Junjie
    Wu, Zongze
    Xie, Shengli
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2019, 38 (06) : 2877 - 2889
  • [32] REVERBERANT AUDIO SOURCE SEPARATION USING PARTIALLY PRE-TRAINED NONNEGATIVE MATRIX FACTORIZATION
    Fakhry, Mahmoud
    Svaizer, Piergiorgio
    Omologo, Maurizio
    2014 14TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2014, : 273 - 277
  • [33] Audio-Visual Underdetermined Blind Source Separation Algorithm Based on Gaussian Potential Function
    ZHANG Ye
    CAO Kang
    WU Kangrui
    YU Tenglong
    ZHOU Nanrun
    中国通信, 2014, 11 (06) : 71 - 80
  • [34] Rigid Motion Model for Audio Source Separation
    Wolf, Guy
    Mallat, Stephane
    Shamma, Shihab
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2016, 64 (07) : 1822 - 1831
  • [35] TIME-DOMAIN AUDIO SOURCE SEPARATION BASED ON GAUSSIAN PROCESSES WITH DEEP KERNEL LEARNING
    Nugraha, Aditya Arie
    Di Carlo, Diego
    Bando, Yoshiaki
    Fontaine, Mathieu
    Yoshii, Kazuyoshi
    2023 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, WASPAA, 2023,
  • [36] Audio-Visual Underdetermined Blind Source Separation Algorithm Based on Gaussian Potential Function
    Zhang Ye
    Cao Kang
    Wu Kangrui
    Yu Tenglong
    Zhou Nanrun
    CHINA COMMUNICATIONS, 2014, 11 (06) : 71 - 80
  • [37] Underdetermined Instantaneous Audio Source Separation via Local Gaussian Modeling
    Vincent, Emmanuel
    Arberet, Simon
    Gribonval, Remi
    INDEPENDENT COMPONENT ANALYSIS AND SIGNAL SEPARATION, PROCEEDINGS, 2009, 5441 : 775 - 782
  • [38] FastFCA: Joint Diagonalization Based Acceleration of Audio Source Separation Using a Full-Rank Spatial Covariance Model
    Ito, Nobutaka
    Araki, Shoko
    Nakatani, Tomohiro
    2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 1667 - 1671
  • [39] Blind Source Separation Based on Local Generalized Gaussian Mixture Model
    Chen, Yongqiang
    2017 IEEE 9TH INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN), 2017, : 935 - 938
  • [40] A FAST EM ALGORITHM FOR GAUSSIAN MODEL-BASED SOURCE SEPARATION
    Thiemann, Joachim
    Vincent, Emmanuel
    2013 PROCEEDINGS OF THE 21ST EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2013,