NONNEGATIVE TENSOR FACTORIZATION FOR SOURCE SEPARATION OF LOOPS IN AUDIO

被引:0
|
作者
Smith, Jordan B. L. [1 ]
Goto, Masataka [1 ]
机构
[1] Natl Inst Adv Ind Sci & Technol, Tokyo, Japan
关键词
nonnegative tensor factorization; source separation; loop-based music; repetition; DECONVOLUTION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The prevalence of exact repetition in loop-based music makes it an opportune target for source separation. Nonnegative factorization approaches have been used to model the repetition of looped content, and kernel additive modeling has leveraged periodicity within a piece to separate looped background elements. We propose a novel method of leveraging periodicity in a factorization model: we treat the two-dimensional spectrogram as a three-dimensional tensor, and use nonnegative tensor factorization to estimate the component spectral templates, rhythms and loop recurrences in a single step. Testing our method on synthesized loop-based examples, we find that our algorithm mostly exceeds the performance of competing methods, with a reduction in execution cost. We discuss limitations of the algorithm as we demonstrate its potential to analyze larger and more complex songs.
引用
收藏
页码:171 / 175
页数:5
相关论文
共 50 条
  • [1] CORRELATED TENSOR FACTORIZATION FOR AUDIO SOURCE SEPARATION
    Yoshii, Kazuyoshi
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 731 - 735
  • [2] MULTICHANNEL NONNEGATIVE TENSOR FACTORIZATION WITH STRUCTURED CONSTRAINTS FOR USER-GUIDED AUDIO SOURCE SEPARATION
    Ozerov, Alexey
    Fevotte, Cedric
    Blouet, Raphael
    Durrieu, Jean-Louis
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 257 - 260
  • [3] STUDENT'S T NONNEGATIVE MATRIX FACTORIZATION AND POSITIVE SEMIDEFINITE TENSOR FACTORIZATION FOR SINGLE-CHANNEL AUDIO SOURCE SEPARATION
    Yoshii, Kazuyoshi
    Itoyama, Katsutoshi
    Goto, Masataka
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 51 - 55
  • [4] Blind Separation of Audio Mixtures Through Nonnegative Tensor Factorization of Modulation Spectrograms
    Barker, Tom
    Virtanen, Tuomas
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (12) : 2377 - 2389
  • [5] Multichannel Nonnegative Matrix Factorization in Convolutive Mixtures for Audio Source Separation
    Ozerov, Alexey
    Fevotte, Cedric
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (03): : 550 - 563
  • [6] BAYESIAN MULTICHANNEL NONNEGATIVE MATRIX FACTORIZATION FOR AUDIO SOURCE SEPARATION AND LOCALIZATION
    Itakura, Kousuke
    Bando, Yoshiaki
    Nakamura, Eita
    Itoyama, Katsutoshi
    Yoshii, Kazuyoshi
    Kawahara, Tatsuya
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 551 - 555
  • [7] Beamspace-Domain Multichannel Nonnegative Matrix Factorization for Audio Source Separation
    Lee, Seokjin
    Park, Sang Ha
    Sung, Koeng-Mo
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2012, 19 (01) : 43 - 46
  • [8] Audio Source Separation Based on Nonnegative Matrix Factorization with Graph Harmonic Structure
    Ichita, Tomohiro
    Kyochi, Seisuke
    Imoto, Keisuke
    [J]. 2018 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2018, : 1148 - 1152
  • [9] Underdetermined Blind Source Separation Combining Tensor Decomposition and Nonnegative Matrix Factorization
    Xie, Yuan
    Xie, Kan
    Yang, Junjie
    Xie, Shengli
    [J]. SYMMETRY-BASEL, 2018, 10 (10):
  • [10] Coding-Based Informed Source Separation: Nonnegative Tensor Factorization Approach
    Ozerov, Alexey
    Liutkus, Antoine
    Badeau, Roland
    Richard, Gael
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (08): : 1699 - 1712