Mixtures of Gamma Priors for Non-negative Matrix Factorization Based Speech Separation

被引:0
|
作者
Virtanen, Tuomas [1 ]
Cemgil, Ali Taylan [2 ]
机构
[1] Tampere Univ Technol, Korkeakoulunkatu 1, FI-33720 Tampere, Finland
[2] Bogazici Univ, Dept Comp Engn, TR-34342 Istanbul, Turkey
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper deals with audio source separation using supervised non-negative matrix factorization (NMF). We propose a prior model based on mixtures of Gamma distributions for each sound class, which hyperparameters are trained given a training corpus. This formulation allows adapting the spectral basis vectors of the sound sources during actual operation, when the exact characteristics of the sources are not known in advance. Simulations were conducted using a random mixture of two speakers. Even without adaptation the mixture model outperformed the basic NMF, and adaptation furher improved slightly the separation quality. Audio demonstrations are available at www.cs.tut.fi/(similar to)tuomasv.
引用
收藏
页码:646 / +
页数:2
相关论文
共 50 条
  • [31] Image fusion based on non-negative matrix factorization
    Zhang, JY
    Wei, L
    Miao, QG
    Wang, Y
    ICIP: 2004 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1- 5, 2004, : 973 - 976
  • [32] Regularized Non-negative Matrix Factorization with Temporal Dependencies for Speech Denoising
    Wilson, Kevin W.
    Raj, Bhiksha
    Smaragdis, Paris
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 411 - +
  • [33] Speech recognition in mixed sound of speech and music based on vector quantization and non-negative matrix factorization
    Nakano, Shoichi
    Yamamoto, Kazumasa
    Nakagawa, Seiichi
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1792 - 1795
  • [34] BASIS COMPENSATION IN NON-NEGATIVE MATRIX FACTORIZATION MODEL FOR SPEECH ENHANCEMENT
    Chung, Hanwook
    Plourde, Eric
    Champagne, Benoit
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 2249 - 2253
  • [35] SPEECH EMOTION RECOGNITION USING TRANSFER NON-NEGATIVE MATRIX FACTORIZATION
    Song, Peng
    Ou, Shifeng
    Zheng, Wenming
    Jin, Yun
    Zhao, Li
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5180 - 5184
  • [36] IMAGE PREDICTION BASED ON NON-NEGATIVE MATRIX FACTORIZATION
    Turkan, Mehmet
    Guillemot, Christine
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 789 - 792
  • [37] Link prediction based on non-negative matrix factorization
    Chen, Bolun
    Li, Fenfen
    Chen, Senbo
    Hu, Ronglin
    Chen, Ling
    PLOS ONE, 2017, 12 (08):
  • [38] Analysis of compatibility of dyes in mixtures by means of non-negative matrix factorization
    Charmi, Fatemeh
    Amirshahi, Seyed Hossein
    COLOR RESEARCH AND APPLICATION, 2021, 46 (05): : 1057 - 1065
  • [39] Monaural noisy speech separation combining sparse non-negative matrix factorization and deep attractor network
    GE Wanying
    ZHANG Tianqi
    FAN Congcong
    ZHANG Tian
    Chinese Journal of Acoustics, 2021, 40 (02) : 266 - 280
  • [40] Monaural noisy speech separation combining sparse non-negative matrix factorization and deep attractor network
    Ge, Wanying
    Zhang, Tianqi
    Fan, Congcong
    Zhang, Tian
    Shengxue Xuebao/Acta Acustica, 2021, 46 (01): : 55 - 66