Hybrid Projective Nonnegative Matrix Factorization With Drum Dictionaries for Harmonic/Percussive Source Separation

被引:2
|
作者
Laroche, Clement [1 ,2 ]
Kowalski, Matthieu [2 ]
Papadopoulos, Helene [2 ]
Richard, Gael [1 ]
机构
[1] Univ Paris Saclay, Telecom ParisTech, LTCI, F-75013 Paris, France
[2] Univ Paris Sud, Cent Supelec, CNRS, UMR 8506,Lab Signaux & Syst, F-91192 Gif Sur Yvette, France
关键词
Nonnegative matrix factorization; projective nonnegative matrix factorization; audio source separation; harmonic/percussive decomposition; POLYPHONIC MUSIC; MELODY EXTRACTION; SPEECH SIGNALS; TRANSCRIPTION; DECOMPOSITION; ALGORITHMS;
D O I
10.1109/TASLP.2018.2830116
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
One of the most general models of music signals considers that such signals can be represented as a sum of two distinct components: a tonal part that is sparse in frequency and temporally stable and a transient (or percussive) part that is composed of short-term broadband sounds. In this paper, we propose a novel hybrid method built upon nonnegative matrix factorization (NMF) that decomposes the time frequency representation of an audio signal into such two components. The tonal part is estimated by a sparse and orthogonal nonnegative decomposition, and the transient part is estimated by a straightforward NMF decomposition constrained by a pre-learned dictionary of smooth spectra. The optimization problem at the heart of our method remains simple with very few hyperparameters and can be solved thanks to simple multiplicative update rules. The extensive benchmark on a large and varied music database against four state of the art harmonic/percussive source separation algorithms demonstrate the merit of the proposed approach.
引用
收藏
页码:1499 / 1511
页数:13
相关论文
共 50 条
  • [1] Initialization of Nonnegative Matrix Factorization Dictionaries for Single Channel Source Separation
    Grais, Emad M.
    Erdogan, Hakan
    2013 21ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2013,
  • [2] NONNEGATIVE MATRIX PARTIAL CO-FACTORIZATION FOR DRUM SOURCE SEPARATION
    Yoo, Jiho
    Kim, Minje
    Kang, Kyeongok
    Choi, Seungjin
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 1942 - 1945
  • [3] Nonnegative Matrix Partial Co-Factorization for Spectral and Temporal Drum Source Separation
    Kim, Minje
    Yoo, Jiho
    Kang, Kyeongok
    Choi, Seungjin
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2011, 5 (06) : 1192 - 1204
  • [4] Audio Source Separation Based on Nonnegative Matrix Factorization with Graph Harmonic Structure
    Ichita, Tomohiro
    Kyochi, Seisuke
    Imoto, Keisuke
    2018 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2018, : 1148 - 1152
  • [5] A STRUCTURED NONNEGATIVE MATRIX FACTORIZATION FOR SOURCE SEPARATION
    Laroche, Clement
    Kowalski, Matthieu
    Papadopoulos, Helene
    Richard, Gael
    2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 2033 - 2037
  • [6] Toward Finding Optimal Source Dictionaries for Single Channel Music Source Separation Using Nonnegative Matrix Factorization
    Rathnayake, Bhathiya
    Weerakoon, K. M. K.
    Godaliyadda, G. M. R., I
    Ekanayake, M. P. B.
    2018 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI), 2018, : 1493 - 1500
  • [7] Projective Nonnegative Matrix Factorization with α-Divergence
    Yang, Zhirong
    Oja, Erkki
    ARTIFICIAL NEURAL NETWORKS - ICANN 2009, PT I, 2009, 5768 : 20 - 29
  • [8] A PROJECTIVE APPROACH TO NONNEGATIVE MATRIX FACTORIZATION
    Groetzner, Patrick
    ELECTRONIC JOURNAL OF LINEAR ALGEBRA, 2021, 37 : 583 - 597
  • [9] Linear and Nonlinear Projective Nonnegative Matrix Factorization
    Yang, Zhirong
    Oja, Erkki
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2010, 21 (05): : 734 - 749
  • [10] Global Minimization of the Projective Nonnegative Matrix Factorization
    Yuan, Zhijian
    ADVANCES IN NEURO-INFORMATION PROCESSING, PT II, 2009, 5507 : 987 - 993