Initialization of Nonnegative Matrix Factorization Dictionaries for Single Channel Source Separation

被引:0
|
作者
Grais, Emad M. [1 ]
Erdogan, Hakan [1 ]
机构
[1] Sabanci Univ, Fac Engn & Nat Sci, TR-34956 Istanbul, Turkey
关键词
Nonnegative matrix factorization; single channel source separation; dictionary learning; fuzzy clustering; principal component analysis; data clustering;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this work, we study different initialization methods for the nonnegative matrix factorization (NMF) dictionaries or bases. There is a need for good initializations for NMF dictionary because NMF decomposition is a non-convex problem which has many local minima. The effect of the initialization of NMF is evaluated in this work on audio source separation applications. In supervised audio source separation, NMF is used to train a set of basis vectors (basis matrix) for each source in an iterative fashion. Then NMF is used to decompose the mixed signal spectrogram as a weighted linear combination of the trained basis vectors for all sources in the mixed signal. The estimate for each source is computed by summing the decomposition terms that include its corresponding trained bases. In this work, we use principal component analysis (PCA), spherical K-means, and fuzzy C-means (FCM) to initialize the NMF basis matrices during the training procedures. Experimental results show that, better initialization for NMF bases gives better audio separation performance than using NMF with random initialization.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] Toward Finding Optimal Source Dictionaries for Single Channel Music Source Separation Using Nonnegative Matrix Factorization
    Rathnayake, Bhathiya
    Weerakoon, K. M. K.
    Godaliyadda, G. M. R., I
    Ekanayake, M. P. B.
    2018 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI), 2018, : 1493 - 1500
  • [2] Sequential Initialization of Multichannel Nonnegative Matrix Factorization for Sound Source Separation
    Uramoto, Takanobu
    Tachioka, Yuuki
    Narita, Tomohiro
    Miura, Iori
    Uenohara, Shingo
    Furuya, Ken'ichi
    2017 IEEE 6TH GLOBAL CONFERENCE ON CONSUMER ELECTRONICS (GCCE), 2017,
  • [3] Hybrid Projective Nonnegative Matrix Factorization With Drum Dictionaries for Harmonic/Percussive Source Separation
    Laroche, Clement
    Kowalski, Matthieu
    Papadopoulos, Helene
    Richard, Gael
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (09) : 1499 - 1511
  • [4] Nonnegative matrix factorization 2D with the flexible β-Divergence for Single Channel Source Separation
    Yu, Kaiwen
    Woo, W. L.
    Dlay, S. S.
    2015 IEEE INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS 2015), 2015,
  • [5] SINGLE CHANNEL SOURCE SEPARATION USING SMOOTH NONNEGATIVE MATRIX FACTORIZATION WITH MARKOV RANDOM FIELDS
    Kim, Minje
    Smaragdis, Paris
    2013 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2013,
  • [6] Nonnegative Matrix Factorization with Disjointness Constraints for Single Channel Speech Separation
    Huang, Jianjun
    Zhang, Xiongwei
    Zhang, Yafei
    Wu, Haijia
    PROCEEDINGS OF 2012 IEEE 14TH INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY, 2012, : 1149 - 1153
  • [7] A STRUCTURED NONNEGATIVE MATRIX FACTORIZATION FOR SOURCE SEPARATION
    Laroche, Clement
    Kowalski, Matthieu
    Papadopoulos, Helene
    Richard, Gael
    2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 2033 - 2037
  • [8] Hidden Markov Models as Priors for Regularized Nonnegative Matrix Factorization in Single-Channel Source Separation
    Grais, Emad M.
    Erdogan, Hakan
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1534 - 1537
  • [9] Gaussian Mixture Gain Priors for Regularized Nonnegative Matrix Factorization in Single-Channel Source Separation
    Grais, Emad M.
    Erdogan, Hakan
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1518 - 1521
  • [10] Regularized nonnegative matrix factorization using Gaussian mixture priors for supervised single channel source separation
    Grais, Emad M.
    Erdogan, Hakan
    COMPUTER SPEECH AND LANGUAGE, 2013, 27 (03): : 746 - 762