LOCAL GAUSSIAN MODEL WITH SOURCE-SET CONSTRAINTS IN AUDIO SOURCE SEPARATION

被引:0
|
作者
Ikeshita, Rintaro [1 ]
Togami, Masahito [1 ]
Kawaguchi, Yohei [1 ]
Fujita, Yusuke [1 ]
Nagamatsu, Kenji [1 ]
机构
[1] Hitachi Ltd, Res & Dev Grp, Tokyo, Japan
关键词
Blind audio source separation; local Gaussian model; time-frequency mask; diffusion noise; permutation alignment; NONNEGATIVE MATRIX FACTORIZATION; MIXTURES;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
To improve the performance of blind audio source separation of convolutive mixtures, the local Gaussian model (LGM) having full rank covariance matrices proposed by Duong et al. is extended. The previous model basically assumes that all sources contribute to each time-frequency slot, which may fail to capture the characteristic of signals with many intermittent silent periods. A constraint on source sets that contribute to each time-frequency slot is therefore explicitly introduced. This approach can be regarded as a relaxation of the sparsity constraint in the conventional time-frequency mask. The proposed model is jointly optimized among the original local Gaussian model parameters, the relaxed version of the time-frequency mask, and a permutation alignment, leading to a robust permutation-free algorithm. We also present a novel multi-channel Wiener filter weighted by a relaxed version of the time-frequency mask. Experimental results over noisy speech signals show that the proposed model is effective compared with the original local Gaussian model and is comparable to its extension, the multi-channel nonnegative matrix factorization.
引用
下载
收藏
页数:6
相关论文
共 50 条
  • [31] ON-THE-FLY AUDIO SOURCE SEPARATION
    El Badawy, Dalia
    Duong, Ngoc Q. K.
    Ozerov, Alexey
    2014 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2014,
  • [32] Audio source separation: solutions and problems
    Mitianoudis, N
    Davies, ME
    INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2004, 18 (03) : 299 - 314
  • [33] DOPING AUDIO SIGNALS FOR SOURCE SEPARATION
    Mahe, Gael
    Nadalin, Everton Z.
    Romano, Joao-Marcos T.
    2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 2402 - 2406
  • [34] Audio source separation with a single sensor
    Benaroya, L
    Bimbot, F
    Gribonval, R
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (01): : 191 - 199
  • [35] ADVERSARIAL ATTACKS ON AUDIO SOURCE SEPARATION
    Takahashi, Naoya
    Inoue, Shota
    Mitsufuji, Yuki
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 521 - 525
  • [36] WEAKLY INFORMED AUDIO SOURCE SEPARATION
    Schulze-Forster, Kilian
    Doire, Clement
    Richard, Gael
    Badeau, Roland
    2019 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2019, : 273 - 277
  • [37] Measure of Quality of Source Separation for Sub- and Super-Gaussian Audio Mixtures
    Naik, Ganesh R.
    INFORMATICA, 2012, 23 (04) : 581 - 599
  • [38] An online algorithm for blind source separation with Gaussian mixture model
    Ohata, M
    Tokunari, T
    Matsuoka, K
    IEEE 2000 ADAPTIVE SYSTEMS FOR SIGNAL PROCESSING, COMMUNICATIONS, AND CONTROL SYMPOSIUM - PROCEEDINGS, 2000, : 375 - 378
  • [39] Source adaptive blind source separation: Gaussian models and sparsity
    Pham, DT
    Cardoso, JF
    WAVELETS: APPLICATIONS IN SIGNAL AND IMAGE PROCESSING X, PTS 1 AND 2, 2003, 5207 : 340 - 351
  • [40] Model-Based STFT Phase Recovery for Audio Source Separation
    Magron, Paul
    Badeau, Roland
    David, Bertrand
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (06) : 1091 - 1101