Stereophonic Music Separation Based on Non-negative Tensor Factorization with Cepstrum Regularization

被引:0
|
作者
Seki, Shogo [1 ]
Toda, Tomoki [2 ]
Takeda, Kazuya [1 ]
机构
[1] Nagoya Univ, Grad Sch Informat Sci, Chikusa Ku, Furo Cho, Nagoya, Aichi 4648601, Japan
[2] Nagoya Univ, Ctr Informat Technol, Chikusa Ku, Furo Cho, Nagoya, Aichi 4648601, Japan
关键词
AUDIO SOURCE SEPARATION; MATRIX FACTORIZATION; MIXTURES;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents a novel approach to stereophonic music separation based on Non-negative Tensor Factorization (NTF). Stereophonic music is roughly divided into two types; recorded music or synthesized music, which we focus on synthesized one in this paper. Synthesized music signals are often generated as linear combinations of many individual source signals with their mixing gains (i.e., time-invariant amplitude scaling) to each channel signal. Therefore, the synthesized stereophonic music separation is the underdetermined source separation problem where phase components are not helpful for the separation. NTF is one of the effective techniques to handle this problem, decomposing amplitude spectrograms of the stereo channel music signal into basis vectors and activations of individual music source signals and their corresponding mixing gains. However, it is essentially difficult to obtain sufficient separation performance in this separation problem as available acoustic cues for separation are limited. To address this issue, we propose a cepstrum regularization method for NTF-based stereo channel separation. The proposed method makes the separated music source signals follow the corresponding Gaussian mixture models of individual music source signals, which are trained in advance using their available samples. An experimental evaluation using real music signals is conducted to investigate the effectiveness of the proposed method in both supervised and unsupervised separation frameworks. The experimental results demonstrate that the proposed method yields significant improvements in separation performance in both frameworks.
引用
收藏
页码:981 / 985
页数:5
相关论文
共 50 条
  • [41] Non-Negative Matrix Factorization Based Compensation of Music for Automatic Speech Recognition
    Raj, Bhiksha
    Virtanen, Tuomas
    Chaudhuri, Sourish
    Singh, Rita
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 717 - +
  • [42] Singing Voice Separation for Mono-Channel Music Using Non-negative Matrix Factorization
    Chanrungutai, Angkana
    Ratanamahatana, Chotirat Ann
    2008 INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR COMMUNICATIONS, PROCEEDINGS, 2008, : 247 - 250
  • [43] AN ADAPTIVE TIME-FREQUENCY RESOLUTION FRAMEWORK FOR SINGLE CHANNEL SOURCE SEPARATION BASED ON NON-NEGATIVE TENSOR FACTORIZATION
    Kirbiz, S.
    Gunsel, B.
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 905 - 909
  • [44] Advancing Non-Negative Latent Factorization of Tensors With Diversified Regularization Schemes
    Wu, Hao
    Luo, Xin
    Zhou, Mengchu
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2022, 15 (03) : 1334 - 1344
  • [45] Non-negative matrix factorization via adaptive sparse graph regularization
    Zhang, Guifang
    Chen, Jiaxin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (08) : 12507 - 12524
  • [46] FLEXIBLE NON-NEGATIVE MATRIX FACTORIZATION WITH ADAPTIVELY LEARNED GRAPH REGULARIZATION
    Peng, Yong
    Long, Yanfang
    Qin, Feiwei
    Kong, Wanzeng
    Nie, Feiping
    Cichocki, Andrzej
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 3107 - 3111
  • [47] Non-negative enhanced discriminant matrix factorization method with sparsity regularization
    Tong, Ming
    Bu, Haili
    Zhao, Mengao
    Xi, Shengnan
    Li, Hailong
    NEURAL COMPUTING & APPLICATIONS, 2019, 31 (07): : 3117 - 3140
  • [48] Non-negative enhanced discriminant matrix factorization method with sparsity regularization
    Ming Tong
    Haili Bu
    Mengao Zhao
    Shengnan Xi
    Hailong Li
    Neural Computing and Applications, 2019, 31 : 3117 - 3140
  • [49] Non-negative matrix factorization via adaptive sparse graph regularization
    Guifang Zhang
    Jiaxin Chen
    Multimedia Tools and Applications, 2021, 80 : 12507 - 12524
  • [50] Mixtures of Gamma Priors for Non-negative Matrix Factorization Based Speech Separation
    Virtanen, Tuomas
    Cemgil, Ali Taylan
    INDEPENDENT COMPONENT ANALYSIS AND SIGNAL SEPARATION, PROCEEDINGS, 2009, 5441 : 646 - +