A STRUCTURED NONNEGATIVE MATRIX FACTORIZATION FOR SOURCE SEPARATION

被引:0
|
作者
Laroche, Clement [1 ,2 ]
Kowalski, Matthieu [2 ,3 ]
Papadopoulos, Helene [2 ]
Richard, Gael [1 ]
机构
[1] Telecom ParisTech, Inst Mines Telecom, CNRS LTCE, Paris, France
[2] Univ Paris 11, CNRS, Cent Supelec, L2S, Gif Sur Yvette, France
[3] CEA Saclay, INRIA, Parietal Project Team, F-91191 Gif Sur Yvette, France
关键词
nonnegative matrix factorization; projective nonnegative matrix factorization; audio source separation; harmonic/percussive decomposition;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we propose a new unconstrained nonnegative matrix factorization method designed to utilize the multilayer structure of audio signals to improve the quality of the source separation. The tonal layer is sparse in frequency and temporally stable, while the transient layer is composed of short term broadband sounds. Our method has a part well suited for tonal extraction which decomposes the signals in sparse orthogonal components, while the transient part is represented by a regular nonnegative matrix factorization decomposition. Experiments on synthetic and real music data in a source separation context show that such decomposition is suitable for audio signal. Compared with three state-of-the-art harmonic/percussive decomposition algorithms, the proposed method shows competitive performances.
引用
收藏
页码:2033 / 2037
页数:5
相关论文
共 50 条
  • [41] Transductive Convolutive Nonnegative Matrix Factorization for Speech Separation
    Mai, Yaodan
    Lan, Long
    Guan, Naiyang
    Zhang, Xiang
    Luo, Zhigang
    [J]. PROCEEDINGS OF 2015 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2015), 2015, : 1400 - 1404
  • [42] Orthogonal Nonnegative Matrix Factorization for Blind Image Separation
    Mirzal, Andri
    [J]. ADVANCES IN VISUAL INFORMATICS, 2013, 8237 : 25 - 35
  • [43] NONNEGATIVE TENSOR FACTORIZATION FOR SOURCE SEPARATION OF LOOPS IN AUDIO
    Smith, Jordan B. L.
    Goto, Masataka
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 171 - 175
  • [44] Deep Transductive Nonnegative Matrix Factorization for Speech Separation
    Liu, Yalin
    Guan, Naiyang
    Liu, Jie
    [J]. 2017 16TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2017, : 249 - 254
  • [45] Discriminative separable nonnegative matrix factorization by structured sparse regularization
    Wang, Shengzheng
    Peng, Jing
    Liu, Wei
    [J]. SIGNAL PROCESSING, 2016, 120 : 620 - 626
  • [46] Structured Convex Optimization Method for Orthogonal Nonnegative Matrix Factorization
    Pan, Junjun
    Ng, Michael K.
    Zhang, Xiongjun
    [J]. 2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 459 - 464
  • [47] Robust Structured Convex Nonnegative Matrix Factorization for Data Representation
    Yang, Qing
    Yin, Xuesong
    Kou, Simin
    Wang, Yigang
    [J]. IEEE ACCESS, 2021, 9 : 155087 - 155102
  • [48] Parallel multichannel blind source separation using a spatial covariance model and nonnegative matrix factorization
    A. J. Muñoz-Montoro
    J. J. Carabias-Orti
    R. Cortina
    S. García-Galán
    J. Ranilla
    [J]. The Journal of Supercomputing, 2021, 77 : 12143 - 12156
  • [49] Parallel multichannel blind source separation using a spatial covariance model and nonnegative matrix factorization
    Munoz-Montoro, A. J.
    Carabias-Orti, J. J.
    Cortina, R.
    Garcia-Galan, S.
    Ranilla, J.
    [J]. JOURNAL OF SUPERCOMPUTING, 2021, 77 (10): : 12143 - 12156
  • [50] REVERBERANT AUDIO SOURCE SEPARATION USING PARTIALLY PRE-TRAINED NONNEGATIVE MATRIX FACTORIZATION
    Fakhry, Mahmoud
    Svaizer, Piergiorgio
    Omologo, Maurizio
    [J]. 2014 14TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2014, : 273 - 277