Initialization of Nonnegative Matrix Factorization Dictionaries for Single Channel Source Separation

被引:0
|
作者
Grais, Emad M. [1 ]
Erdogan, Hakan [1 ]
机构
[1] Sabanci Univ, Fac Engn & Nat Sci, TR-34956 Istanbul, Turkey
关键词
Nonnegative matrix factorization; single channel source separation; dictionary learning; fuzzy clustering; principal component analysis; data clustering;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this work, we study different initialization methods for the nonnegative matrix factorization (NMF) dictionaries or bases. There is a need for good initializations for NMF dictionary because NMF decomposition is a non-convex problem which has many local minima. The effect of the initialization of NMF is evaluated in this work on audio source separation applications. In supervised audio source separation, NMF is used to train a set of basis vectors (basis matrix) for each source in an iterative fashion. Then NMF is used to decompose the mixed signal spectrogram as a weighted linear combination of the trained basis vectors for all sources in the mixed signal. The estimate for each source is computed by summing the decomposition terms that include its corresponding trained bases. In this work, we use principal component analysis (PCA), spherical K-means, and fuzzy C-means (FCM) to initialize the NMF basis matrices during the training procedures. Experimental results show that, better initialization for NMF bases gives better audio separation performance than using NMF with random initialization.
引用
收藏
页数:4
相关论文
共 50 条
  • [41] Audio Source Separation Based on Nonnegative Matrix Factorization with Graph Harmonic Structure
    Ichita, Tomohiro
    Kyochi, Seisuke
    Imoto, Keisuke
    2018 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2018, : 1148 - 1152
  • [42] Dual-Transform Source Separation Using Sparse Nonnegative Matrix Factorization
    Md. Imran Hossain
    Md. Shohidul Islam
    Mst. Titasa Khatun
    Rizwan Ullah
    Asim Masood
    Zhongfu Ye
    Circuits, Systems, and Signal Processing, 2021, 40 : 1868 - 1891
  • [43] Layered Nonnegative Matrix Factorization for Speech Separation
    Hsu, Chung-Chien
    Chien, Jen-Tzung
    Chi, Tai-Shih
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 628 - 632
  • [44] Nonnegative matrix factor 2-D deconvolution for blind single channel source separation
    Schmidt, MN
    Morup, M
    INDEPENDENT COMPONENT ANALYSIS AND BLIND SIGNAL SEPARATION, PROCEEDINGS, 2006, 3889 : 700 - 707
  • [45] Initialization of nonnegative matrix factorization by Gaussian primaries for reconstruction of spectral data
    Farajikhah, Syamak
    Amirshahi, Seyed Hossein
    OPTICAL REVIEW, 2012, 19 (05) : 294 - 305
  • [46] Initialization of nonnegative matrix factorization by Gaussian primaries for reconstruction of spectral data
    Syamak Farajikhah
    Seyed Hossein Amirshahi
    Optical Review, 2012, 19 : 294 - 305
  • [47] β-Divergence Two-Dimensional Sparse Nonnegative Matrix Factorization for Audio Source Separation
    Darsono, A. M.
    Haron, N. Z.
    Jaafar, A. S.
    Ahmad, M. I.
    2013 IEEE CONFERENCE ON WIRELESS SENSOR (ICWISE), 2013, : 119 - 123
  • [48] The Source Separation of Multi-channel Vibration Signal Based on Nonnegative Tensor Factorization
    Li, Guang
    Liang, Lin
    Liu, Dan
    Li, Maolin
    Wang, Bao
    Xu, Guanghua
    2018 10TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS (ICCCAS 2018), 2018, : 359 - 363
  • [49] Perceptual Single-Channel Audio Source Separation by Non-negative Matrix Factorization
    Kirbiz, Serap
    Gunsel, Bilge
    2009 IEEE 17TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, VOLS 1 AND 2, 2009, : 654 - 657
  • [50] Adaptive Sparsity Non-Negative Matrix Factorization for Single-Channel Source Separation
    Gao, Bin
    Woo, W. L.
    Dlay, S. S.
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2011, 5 (05) : 989 - 1001