Nonnegative Matrix Factorization With Basis Clustering Using Cepstral Distance Regularization

被引:5
|
作者
Kameoka, Hirokazu [1 ]
Higuchi, Takuya [1 ]
Tanaka, Mikihiro [2 ]
Li, Li [3 ]
机构
[1] NTT Corp, NTT Commun Sci Labs, Tokyo 2430198, Japan
[2] Univ Tokyo, Tokyo 1138656, Japan
[3] Univ Tsukuba, Tsukuba, Ibaraki 3058577, Japan
关键词
Audio source separation; nonnegative matrix factorization (NMF); basis clustering; mel-frequency cepstral coefficient (MFCC); majorization-minimization algorithm; MODEL; NMF;
D O I
10.1109/TASLP.2018.2795746
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
One successful approach for audio source separation involves applying nonnegative matrix factorization (NMF) to a magnitude spectrogram regarded as a nonnegative matrix. This can be interpreted as approximating the observed spectra at each time frame as the linear sum of the basis spectra scaled by time-varying amplitudes. This paper deals with the problem of the unsupervised instrument-wise source separation of polyphonic signals based on an extension of the NMF approach. We focus on the fact that each piece of music is typically played on a handful of musical instruments, which allows us to assume that the spectra of the underlying audio events in a polyphonic signal can be grouped into a reasonably small number of clusters in the mel-frequency cepstral coefficient (MFCC) domain. Based on this assumption, we propose formulating factorization of amagnitude spectrogram and clustering of the basis spectra in the MFCC domain as a joint optimization problem and derive a novel optimization algorithm based on the majorization-minimization principle. Experimental results revealed that our method was superior to a two-stage algorithm that consists of performing factorization followed by clustering the basis spectra, thus showing the advantage of the joint optimization approach.
引用
收藏
页码:1025 / 1036
页数:12
相关论文
共 50 条
  • [41] Deep asymmetric nonnegative matrix factorization for graph clustering
    Hajiveiseh, Akram
    Seyedi, Seyed Amjad
    Tab, Fardin Akhlaghian
    PATTERN RECOGNITION, 2024, 148
  • [42] Robust graph regularized nonnegative matrix factorization for clustering
    Huang, Shudong
    Wang, Hongjun
    Li, Tao
    Li, Tianrui
    Xu, Zenglin
    DATA MINING AND KNOWLEDGE DISCOVERY, 2018, 32 (02) : 483 - 503
  • [43] Subspace clustering guided convex nonnegative matrix factorization
    Cui, Guosheng
    Li, Xuelong
    Dong, Yongsheng
    NEUROCOMPUTING, 2018, 292 : 38 - 48
  • [44] Robust graph regularized nonnegative matrix factorization for clustering
    Shudong Huang
    Hongjun Wang
    Tao Li
    Tianrui Li
    Zenglin Xu
    Data Mining and Knowledge Discovery, 2018, 32 : 483 - 503
  • [45] Weighted Nonnegative Matrix Factorization for Image Inpainting and Clustering
    Xiangguang Dai
    Nian Zhang
    Keke Zhang
    Jiang Xiong
    International Journal of Computational Intelligence Systems, 2020, 13 : 734 - 743
  • [46] Weighted Nonnegative Matrix Factorization for Image Inpainting and Clustering
    Dai, Xiangguang
    Zhang, Nian
    Zhang, Keke
    Xiong, Jiang
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2020, 13 (01) : 734 - 743
  • [47] Max-min distance nonnegative matrix factorization
    Wang, Jim Jing-Yan
    Gao, Xin
    NEURAL NETWORKS, 2015, 61 : 75 - 84
  • [48] Robust Graph Regularized Nonnegative Matrix Factorization for Clustering
    Peng, Chong
    Kang, Zhao
    Hu, Yunhong
    Cheng, Jie
    Cheng, Qiang
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2017, 11 (03)
  • [49] Speech source separation avoiding initial value dependency by cepstral-basis-decom-posed nonnegative matrix factorization
    Oshima, Fuga
    Nakayama, Masashi
    ACOUSTICAL SCIENCE AND TECHNOLOGY, 2023, 44 (02) : 137 - 140
  • [50] Nonnegative Matrix Factorization Using Nonnegative Polynomial Approximations
    Debals, Otto
    Van Barel, Marc
    De Lathauwer, Lieven
    IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (07) : 948 - 952