SHIFTED AND CONVOLUTIVE SOURCE-FILTER NON-NEGATIVE MATRIX FACTORIZATION FOR MONAURAL AUDIO SOURCE SEPARATION

被引:0
|
作者
Nakamura, Tomohiko [1 ]
Kameoka, Hirokazu [1 ,2 ]
机构
[1] Univ Tokyo, Grad Sch Informat Sci & Technol, Bunkyo Ku, 7-3-1 Hongo, Tokyo 1138656, Japan
[2] NTT Corp, NTT Commun Sci Labs, 3-1 Morinosato Wakamiya, Atsugi, Kanagawa 2430198, Japan
关键词
Audio source separation; Shifted non-negative matrix factorization; Shift-invariant probabilistic latent component analysis; Source-filter theory; SPARSE;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper proposes an extension of non-negative matrix factorization (NMF), which combines the shifted NMF model with the source-filter model. Shifted NMF was proposed as a powerful approach for monaural source separation and multiple fundamental frequency (F-0) estimation, which is particularly unique in that it takes account of the constant inter-harmonic spacings of a harmonic structure in log-frequency representations and uses a shifted copy of a spectrum template to represent the spectra of different F(0)s. However, for those sounds that follow the source-filter model, this assumption does not hold in reality, since the filter spectra are usually invariant under F-0 changes. A more reasonable way to represent the spectrum of a different F-0 is to use a shifted copy of a harmonic structure template as the excitation spectrum and keep the filter spectrum fixed. Thus, we can describe the spectrogram of a mixture signal as the sum of the products between the shifted copies of excitation spectrum templates and filter spectrum templates. Furthermore, the time course of filter spectra represents the dynamics of the timbre, which is important for characterizing the feature of an instrument sound. Thus, we further incorporate the non-negative matrix factor deconvolution (NMFD) model into the above model to describe the filter spectrogram. We derive a computationally efficient and convergence-guaranteed algorithm for estimating the unknown parameters of the constructed model based on the auxiliary function approach. Experimental results revealed that the proposed method outperformed shifted NMF in terms of the source separation accuracy.
引用
收藏
页码:489 / 493
页数:5
相关论文
共 50 条
  • [41] Perceptually enhanced blind single-channel music source separation by Non-negative Matrix Factorization
    Kirbiz, S.
    Gunsel, B.
    DIGITAL SIGNAL PROCESSING, 2013, 23 (02) : 646 - 658
  • [42] Multichannel non-negative matrix factorisation based on alternating least squares for audio source separation system
    Lee, Seokjin
    Pang, Hee-Suk
    ELECTRONICS LETTERS, 2015, 51 (03) : 197 - 198
  • [43] Monaural noisy speech separation combining sparse non-negative matrix factorization and deep attractor network
    GE Wanying
    ZHANG Tianqi
    FAN Congcong
    ZHANG Tian
    ChineseJournalofAcoustics, 2021, 40 (02) : 266 - 280
  • [44] Monaural noisy speech separation combining sparse non-negative matrix factorization and deep attractor network
    Ge, Wanying
    Zhang, Tianqi
    Fan, Congcong
    Zhang, Tian
    Shengxue Xuebao/Acta Acustica, 2021, 46 (01): : 55 - 66
  • [45] ADAPTIVE CODING OF NON-NEGATIVE FACTORIZATION PARAMETERS WITH APPLICATION TO INFORMED SOURCE SEPARATION
    Blaeser, Max
    Rohlfing, Christian
    Gao, Yingbo
    Wien, Mathias
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 751 - 755
  • [46] Non-negative matrix factorization filter for digital deconvolution
    Griffin, Scott
    Biechele-Speziale, John
    Smith, Casey
    Ximeng You-dow
    Simpson, Garth
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2018, 256
  • [47] Comparison of Convolutive Kernel Compensation and Non-Negative Matrix Factorization of Surface Electromyograms
    Savc, Martin
    Glaser, Vojko
    Kranjec, Jernej
    Cikajlo, Imre
    Matjacic, Zlatko
    Holobar, Ales
    IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2018, 26 (10) : 1935 - 1944
  • [48] Non-Negative Matrix Factorization Algorithms for Blind Source Sepertion in Speech Recognition
    Kumar, Santosh S.
    Bharathi, S. H.
    2017 2ND IEEE INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ELECTRONICS, INFORMATION & COMMUNICATION TECHNOLOGY (RTEICT), 2017, : 2242 - 2246
  • [49] On Audio Enhancement via Online Non-Negative Matrix Factorization
    Sack, Andrew
    Jiang, Wenzhao
    Perlmutter, Michael
    Salanevich, Palina
    Needell, Deanna
    2022 56TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2022, : 287 - 291
  • [50] The Voice Conversion Method Based on Sparse Convolutive Non-negative Matrix Factorization
    Zhang, Qianmin
    Tao, Liang
    Zhou, Jian
    Wang, Huabin
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON ELECTRICAL AND INFORMATION TECHNOLOGIES FOR RAIL TRANSPORTATION: TRANSPORTATION, 2016, 378 : 259 - 267