LOCAL GAUSSIAN MODEL WITH SOURCE-SET CONSTRAINTS IN AUDIO SOURCE SEPARATION

被引:0
|
作者
Ikeshita, Rintaro [1 ]
Togami, Masahito [1 ]
Kawaguchi, Yohei [1 ]
Fujita, Yusuke [1 ]
Nagamatsu, Kenji [1 ]
机构
[1] Hitachi Ltd, Res & Dev Grp, Tokyo, Japan
关键词
Blind audio source separation; local Gaussian model; time-frequency mask; diffusion noise; permutation alignment; NONNEGATIVE MATRIX FACTORIZATION; MIXTURES;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
To improve the performance of blind audio source separation of convolutive mixtures, the local Gaussian model (LGM) having full rank covariance matrices proposed by Duong et al. is extended. The previous model basically assumes that all sources contribute to each time-frequency slot, which may fail to capture the characteristic of signals with many intermittent silent periods. A constraint on source sets that contribute to each time-frequency slot is therefore explicitly introduced. This approach can be regarded as a relaxation of the sparsity constraint in the conventional time-frequency mask. The proposed model is jointly optimized among the original local Gaussian model parameters, the relaxed version of the time-frequency mask, and a permutation alignment, leading to a robust permutation-free algorithm. We also present a novel multi-channel Wiener filter weighted by a relaxed version of the time-frequency mask. Experimental results over noisy speech signals show that the proposed model is effective compared with the original local Gaussian model and is comparable to its extension, the multi-channel nonnegative matrix factorization.
引用
下载
收藏
页数:6
相关论文
共 50 条
  • [1] BAYESIAN ANISOTROPIC GAUSSIAN MODEL FOR AUDIO SOURCE SEPARATION
    Magron, Paul
    Virtanen, Tuomas
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 166 - 170
  • [2] Underdetermined Instantaneous Audio Source Separation via Local Gaussian Modeling
    Vincent, Emmanuel
    Arberet, Simon
    Gribonval, Remi
    INDEPENDENT COMPONENT ANALYSIS AND SIGNAL SEPARATION, PROCEEDINGS, 2009, 5441 : 775 - 782
  • [3] PHASE-DEPENDENT ANISOTROPIC GAUSSIAN MODEL FOR AUDIO SOURCE SEPARATION
    Magron, Paul
    Badeau, Roland
    David, Bertrand
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 531 - 535
  • [4] Gaussian Modeling-Based Multichannel Audio Source Separation Exploiting Generic Source Spectral Model
    Thanh Thi Hien Duong
    Duong, Ngoc Q. K.
    Phuong Cong Nguyen
    Cuong Quoc Nguyen
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (01) : 32 - 43
  • [5] Spatial location priors for Gaussian model based reverberant audio source separation
    Duong, Ngoc Q. K.
    Vincent, Emmanuel
    Gribonval, Remi
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2013,
  • [6] Spatial location priors for Gaussian model based reverberant audio source separation
    Ngoc Q K Duong
    Emmanuel Vincent
    Rémi Gribonval
    EURASIP Journal on Advances in Signal Processing, 2013
  • [7] Blind Source Separation Based on Local Generalized Gaussian Mixture Model
    Chen, Yongqiang
    2017 IEEE 9TH INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN), 2017, : 935 - 938
  • [8] Audio source separation
    Davies, M
    MATHEMATICS IN SIGNAL PROCESSING V, 2002, (71): : 57 - 68
  • [9] Rigid Motion Model for Audio Source Separation
    Wolf, Guy
    Mallat, Stephane
    Shamma, Shihab
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2016, 64 (07) : 1822 - 1831
  • [10] Multichannel Audio Source Separation Exploiting NMF-Based Generic Source Spectral Model in Gaussian Modeling Framework
    Thanh Thi Hien Duong
    Duong, Ngoc Q. K.
    Cong-Phuong Nguyen
    Quoc-Cuong Nguyen
    LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION (LVA/ICA 2018), 2018, 10891 : 547 - 557