SHIFTED AND CONVOLUTIVE SOURCE-FILTER NON-NEGATIVE MATRIX FACTORIZATION FOR MONAURAL AUDIO SOURCE SEPARATION

被引:0
|
作者
Nakamura, Tomohiko [1 ]
Kameoka, Hirokazu [1 ,2 ]
机构
[1] Univ Tokyo, Grad Sch Informat Sci & Technol, Bunkyo Ku, 7-3-1 Hongo, Tokyo 1138656, Japan
[2] NTT Corp, NTT Commun Sci Labs, 3-1 Morinosato Wakamiya, Atsugi, Kanagawa 2430198, Japan
关键词
Audio source separation; Shifted non-negative matrix factorization; Shift-invariant probabilistic latent component analysis; Source-filter theory; SPARSE;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper proposes an extension of non-negative matrix factorization (NMF), which combines the shifted NMF model with the source-filter model. Shifted NMF was proposed as a powerful approach for monaural source separation and multiple fundamental frequency (F-0) estimation, which is particularly unique in that it takes account of the constant inter-harmonic spacings of a harmonic structure in log-frequency representations and uses a shifted copy of a spectrum template to represent the spectra of different F(0)s. However, for those sounds that follow the source-filter model, this assumption does not hold in reality, since the filter spectra are usually invariant under F-0 changes. A more reasonable way to represent the spectrum of a different F-0 is to use a shifted copy of a harmonic structure template as the excitation spectrum and keep the filter spectrum fixed. Thus, we can describe the spectrogram of a mixture signal as the sum of the products between the shifted copies of excitation spectrum templates and filter spectrum templates. Furthermore, the time course of filter spectra represents the dynamics of the timbre, which is important for characterizing the feature of an instrument sound. Thus, we further incorporate the non-negative matrix factor deconvolution (NMFD) model into the above model to describe the filter spectrogram. We derive a computationally efficient and convergence-guaranteed algorithm for estimating the unknown parameters of the constructed model based on the auxiliary function approach. Experimental results revealed that the proposed method outperformed shifted NMF in terms of the source separation accuracy.
引用
收藏
页码:489 / 493
页数:5
相关论文
共 50 条
  • [1] AN INTERACTIVE AUDIO SOURCE SEPARATION FRAMEWORK BASED ON NON-NEGATIVE MATRIX FACTORIZATION
    Duong, Ngoc Q. K.
    Ozerov, Alexey
    Chevallier, Louis
    Sirot, Joel
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [2] SOURCE SEPARATION WITH SCATTERING NON-NEGATIVE MATRIX FACTORIZATION
    Bruna, Joan
    Sprechmann, Pablo
    LeCun, Yann
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 1876 - 1880
  • [3] Perceptual Single-Channel Audio Source Separation by Non-negative Matrix Factorization
    Kirbiz, Serap
    Gunsel, Bilge
    2009 IEEE 17TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, VOLS 1 AND 2, 2009, : 654 - 657
  • [4] Shifted non-negative matrix factorisation for sound source separation
    FitzGerald, Derry
    Cranitch, Matt
    Coyle, Eugene
    2005 IEEE/SP 13TH WORKSHOP ON STATISTICAL SIGNAL PROCESSING (SSP), VOLS 1 AND 2, 2005, : 1061 - 1065
  • [5] BLIND AUDIO SOURCE SEPARATION OF STEREO MIXTURES USING BAYESIAN NON-NEGATIVE MATRIX FACTORIZATION
    Mirzaei, S.
    Van Hamme, H.
    Norouzi, Y.
    2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 621 - 625
  • [6] Multichannel Nonnegative Matrix Factorization in Convolutive Mixtures for Audio Source Separation
    Ozerov, Alexey
    Fevotte, Cedric
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (03): : 550 - 563
  • [7] ADAPTATION OF SOURCE-SPECIFIC DICTIONARIES IN NON-NEGATIVE MATRIX FACTORIZATION FOR SOURCE SEPARATION
    Jaureguiberry, Xabier
    Leveau, Pierre
    Maller, Simon
    Burred, Juan Jose
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5 - 8
  • [8] Multi-source separation based on non-negative matrix factorization and source distribution
    Jia, Xinyu
    Jia, Maoshen
    Gao, Shang
    Zhang, Yu
    2021 IMMERSIVE AND 3D AUDIO: FROM ARCHITECTURE TO AUTOMOTIVE (I3DA), 2021,
  • [9] Source Separation Based on Non-Negative Matrix Factorization of the Synchrosqueezing Transform
    Singh, Neha
    Meignen, Sylvain
    Oberlin, Thomas
    29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 1910 - 1914
  • [10] Blind source separation with optimal transport non-negative matrix factorization
    Antoine Rolet
    Vivien Seguy
    Mathieu Blondel
    Hiroshi Sawada
    EURASIP Journal on Advances in Signal Processing, 2018