Stereophonic Music Separation Based on Non-negative Tensor Factorization with Cepstrum Regularization

被引：0

作者：

Seki, Shogo ^{[1
]}

Toda, Tomoki ^{[2
]}

Takeda, Kazuya ^{[1
]}

机构：

[1] Nagoya Univ, Grad Sch Informat Sci, Chikusa Ku, Furo Cho, Nagoya, Aichi 4648601, Japan

[2] Nagoya Univ, Ctr Informat Technol, Chikusa Ku, Furo Cho, Nagoya, Aichi 4648601, Japan

来源：

2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO) | 2017年

关键词：

AUDIO SOURCE SEPARATION; MATRIX FACTORIZATION; MIXTURES;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This paper presents a novel approach to stereophonic music separation based on Non-negative Tensor Factorization (NTF). Stereophonic music is roughly divided into two types; recorded music or synthesized music, which we focus on synthesized one in this paper. Synthesized music signals are often generated as linear combinations of many individual source signals with their mixing gains (i.e., time-invariant amplitude scaling) to each channel signal. Therefore, the synthesized stereophonic music separation is the underdetermined source separation problem where phase components are not helpful for the separation. NTF is one of the effective techniques to handle this problem, decomposing amplitude spectrograms of the stereo channel music signal into basis vectors and activations of individual music source signals and their corresponding mixing gains. However, it is essentially difficult to obtain sufficient separation performance in this separation problem as available acoustic cues for separation are limited. To address this issue, we propose a cepstrum regularization method for NTF-based stereo channel separation. The proposed method makes the separated music source signals follow the corresponding Gaussian mixture models of individual music source signals, which are trained in advance using their available samples. An experimental evaluation using real music signals is conducted to investigate the effectiveness of the proposed method in both supervised and unsupervised separation frameworks. The experimental results demonstrate that the proposed method yields significant improvements in separation performance in both frameworks.

引用

页码：981 / 985

页数：5

共 50 条

[41] Non-Negative Matrix Factorization Based Compensation of Music for Automatic Speech Recognition
Raj, Bhiksha
Virtanen, Tuomas
Chaudhuri, Sourish
Singh, Rita
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 717 - +
[42] Singing Voice Separation for Mono-Channel Music Using Non-negative Matrix Factorization
Chanrungutai, Angkana
Ratanamahatana, Chotirat Ann
2008 INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR COMMUNICATIONS, PROCEEDINGS, 2008, : 247 - 250
[43] AN ADAPTIVE TIME-FREQUENCY RESOLUTION FRAMEWORK FOR SINGLE CHANNEL SOURCE SEPARATION BASED ON NON-NEGATIVE TENSOR FACTORIZATION
Kirbiz, S.
Gunsel, B.
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 905 - 909
[44] Advancing Non-Negative Latent Factorization of Tensors With Diversified Regularization Schemes
Wu, Hao
Luo, Xin
Zhou, Mengchu
IEEE TRANSACTIONS ON SERVICES COMPUTING, 2022, 15 (03) : 1334 - 1344
[45] Non-negative matrix factorization via adaptive sparse graph regularization
Zhang, Guifang
Chen, Jiaxin
MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (08) : 12507 - 12524
[46] FLEXIBLE NON-NEGATIVE MATRIX FACTORIZATION WITH ADAPTIVELY LEARNED GRAPH REGULARIZATION
Peng, Yong
Long, Yanfang
Qin, Feiwei
Kong, Wanzeng
Nie, Feiping
Cichocki, Andrzej
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 3107 - 3111
[47] Non-negative enhanced discriminant matrix factorization method with sparsity regularization
Tong, Ming
Bu, Haili
Zhao, Mengao
Xi, Shengnan
Li, Hailong
NEURAL COMPUTING & APPLICATIONS, 2019, 31 (07): : 3117 - 3140
[48] Non-negative enhanced discriminant matrix factorization method with sparsity regularization
Ming Tong
Haili Bu
Mengao Zhao
Shengnan Xi
Hailong Li
Neural Computing and Applications, 2019, 31 : 3117 - 3140
[49] Non-negative matrix factorization via adaptive sparse graph regularization
Guifang Zhang
Jiaxin Chen
Multimedia Tools and Applications, 2021, 80 : 12507 - 12524
[50] Mixtures of Gamma Priors for Non-negative Matrix Factorization Based Speech Separation
Virtanen, Tuomas
Cemgil, Ali Taylan
INDEPENDENT COMPONENT ANALYSIS AND SIGNAL SEPARATION, PROCEEDINGS, 2009, 5441 : 646 - +

← 1 2 3 4 5 →