Mixtures of Gamma Priors for Non-negative Matrix Factorization Based Speech Separation

被引:0
|
作者
Virtanen, Tuomas [1 ]
Cemgil, Ali Taylan [2 ]
机构
[1] Tampere Univ Technol, Korkeakoulunkatu 1, FI-33720 Tampere, Finland
[2] Bogazici Univ, Dept Comp Engn, TR-34342 Istanbul, Turkey
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper deals with audio source separation using supervised non-negative matrix factorization (NMF). We propose a prior model based on mixtures of Gamma distributions for each sound class, which hyperparameters are trained given a training corpus. This formulation allows adapting the spectral basis vectors of the sound sources during actual operation, when the exact characteristics of the sources are not known in advance. Simulations were conducted using a random mixture of two speakers. Even without adaptation the mixture model outperformed the basic NMF, and adaptation furher improved slightly the separation quality. Audio demonstrations are available at www.cs.tut.fi/(similar to)tuomasv.
引用
收藏
页码:646 / +
页数:2
相关论文
共 50 条
  • [1] Robust Non-negative Matrix Factorization with β-Divergence for Speech Separation
    Li, Yinan
    Zhang, Xiongwei
    Sun, Meng
    ETRI JOURNAL, 2017, 39 (01) : 21 - 29
  • [2] Single Channel Music and Speech Separation Using Non-negative Matrix Factorization
    Yidirim, Sinan
    Saraclar, Murat
    2009 IEEE 17TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, VOLS 1 AND 2, 2009, : 543 - 546
  • [3] DISCRIMINATIVE NON-NEGATIVE MATRIX FACTORIZATION FOR SINGLE-CHANNEL SPEECH SEPARATION
    Wang, Zi
    Sha, Fei
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [4] Regularized non-negative matrix factorization with Gaussian mixtures and masking model for speech enhancement
    Chung, Hanwook
    Plourde, Eric
    Champagne, Benoit
    SPEECH COMMUNICATION, 2017, 87 : 18 - 30
  • [5] SOURCE SEPARATION WITH SCATTERING NON-NEGATIVE MATRIX FACTORIZATION
    Bruna, Joan
    Sprechmann, Pablo
    LeCun, Yann
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 1876 - 1880
  • [6] Source Separation Based on Non-Negative Matrix Factorization of the Synchrosqueezing Transform
    Singh, Neha
    Meignen, Sylvain
    Oberlin, Thomas
    29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 1910 - 1914
  • [7] Vocal Separation by Constrained Non-Negative Matrix Factorization
    Ochiai, Eri
    Fujisawa, Takanori
    Ikehara, Masaaki
    2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 480 - 483
  • [8] Performance Evaluation of Single Channel Speech Separation Using Non-Negative Matrix Factorization
    Nandakumar, Mona M.
    Bijoy, Edet K.
    2014 NATIONAL CONFERENCE ON COMMUNICATION, SIGNAL PROCESSING AND NETWORKING (NCCSN), 2014,
  • [9] JOINT SEPARATION AND DEREVERBERATION OF REVERBERANT MIXTURES WITH DETERMINED MULTICHANNEL NON-NEGATIVE MATRIX FACTORIZATION
    Kagami, Hideaki
    Kameoka, Hirokazu
    Yukawa, Masahiro
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 31 - 35
  • [10] Single-Channel Speech Separation using Sparse Non-Negative Matrix Factorization
    Schmidt, Mikkel N.
    Olsson, Rasmus K.
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2614 - 2617