LOCAL GAUSSIAN MODEL WITH SOURCE-SET CONSTRAINTS IN AUDIO SOURCE SEPARATION

被引:0
|
作者
Ikeshita, Rintaro [1 ]
Togami, Masahito [1 ]
Kawaguchi, Yohei [1 ]
Fujita, Yusuke [1 ]
Nagamatsu, Kenji [1 ]
机构
[1] Hitachi Ltd, Res & Dev Grp, Tokyo, Japan
关键词
Blind audio source separation; local Gaussian model; time-frequency mask; diffusion noise; permutation alignment; NONNEGATIVE MATRIX FACTORIZATION; MIXTURES;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
To improve the performance of blind audio source separation of convolutive mixtures, the local Gaussian model (LGM) having full rank covariance matrices proposed by Duong et al. is extended. The previous model basically assumes that all sources contribute to each time-frequency slot, which may fail to capture the characteristic of signals with many intermittent silent periods. A constraint on source sets that contribute to each time-frequency slot is therefore explicitly introduced. This approach can be regarded as a relaxation of the sparsity constraint in the conventional time-frequency mask. The proposed model is jointly optimized among the original local Gaussian model parameters, the relaxed version of the time-frequency mask, and a permutation alignment, leading to a robust permutation-free algorithm. We also present a novel multi-channel Wiener filter weighted by a relaxed version of the time-frequency mask. Experimental results over noisy speech signals show that the proposed model is effective compared with the original local Gaussian model and is comparable to its extension, the multi-channel nonnegative matrix factorization.
引用
下载
收藏
页数:6
相关论文
共 50 条
  • [41] Source separation with Gaussian process models
    Park, Sunho
    Choi, Seungjin
    MACHINE LEARNING: ECML 2007, PROCEEDINGS, 2007, 4701 : 262 - +
  • [42] Gaussian Processes for Underdetermined Source Separation
    Liutkus, Antoine
    Badeau, Roland
    Richard, Gaeel
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2011, 59 (07) : 3155 - 3167
  • [43] Audio-Visual Underdetermined Blind Source Separation Algorithm Based on Gaussian Potential Function
    ZHANG Ye
    CAO Kang
    WU Kangrui
    YU Tenglong
    ZHOU Nanrun
    China Communications, 2014, 11 (06) : 71 - 80
  • [44] TIME-DOMAIN AUDIO SOURCE SEPARATION BASED ON GAUSSIAN PROCESSES WITH DEEP KERNEL LEARNING
    Nugraha, Aditya Arie
    Di Carlo, Diego
    Bando, Yoshiaki
    Fontaine, Mathieu
    Yoshii, Kazuyoshi
    2023 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, WASPAA, 2023,
  • [45] SPARSE GAUSSIAN PROCESS AUDIO SOURCE SEPARATION USING SPECTRUM PRIORS IN THE TIME-DOMAIN
    Alvarado, Pablo A.
    Alvarez, Mauricio A.
    Stowell, Dan
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 995 - 999
  • [46] Audio-Visual Underdetermined Blind Source Separation Algorithm Based on Gaussian Potential Function
    Zhang Ye
    Cao Kang
    Wu Kangrui
    Yu Tenglong
    Zhou Nanrun
    CHINA COMMUNICATIONS, 2014, 11 (06) : 71 - 80
  • [47] ONLINE SPEECH SOURCE SEPARATION BASED ON MAXIMUM LIKELIHOOD OF LOCAL GAUSSIAN MODELING
    Togami, Masahito
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 213 - 216
  • [48] Psychophysical Evaluation of Audio Source Separation Methods
    Simpson, Andrew J. R.
    Roma, Gerard
    Grais, Emad M.
    Mason, Russell D.
    Hummersone, Christopher
    Plumbley, Mark D.
    LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION (LVA/ICA 2017), 2017, 10169 : 211 - 221
  • [49] Predominant audio source separation in polyphonic music
    Reghunath, Lekshmi Chandrika
    Rajan, Rajeev
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2023, 2023 (01)
  • [50] A General Modular Framework for Audio Source Separation
    Ozerov, Alexey
    Vincent, Emmanuel
    Bimbot, Frederic
    LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION, 2010, 6365 : 33 - +