Single Channel Music and Speech Separation Using Non-negative Matrix Factorization

被引：0

作者：

Yidirim, Sinan ^{[1
]}

Saraclar, Murat ^{[1
]}

机构：

[1] Bogazici Univ, Elekt Elekt Muhendisligi Bolumu, TR-34342 Istanbul, Turkey

来源：

2009 IEEE 17TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, VOLS 1 AND 2 | 2009年

关键词：

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper non-negative matrix factorization (NMF) is used to separate speech and music signals based on a single channel recording. The assumption that if two independent zero-mean signals are added then their energies are also added has led us to develop a two-stage method (training and separation) that works on time-frequency domain. The performance of the method in separation is evaluated by observing the power of the separated signals in time-frequency domain, and by measuring the increase in signal-to-interference and signal-to-noise ratios after separation. Finally, we discuss the problems faced and the work that can be done in future to enhance the performance of the method in separation.

引用

页码：543 / 546

页数：4

共 50 条

[1] Performance Evaluation of Single Channel Speech Separation Using Non-Negative Matrix Factorization
Nandakumar, Mona M.
Bijoy, Edet K.
[J]. 2014 NATIONAL CONFERENCE ON COMMUNICATION, SIGNAL PROCESSING AND NETWORKING (NCCSN), 2014,
[2] Single-Channel Speech Separation using Sparse Non-Negative Matrix Factorization
Schmidt, Mikkel N.
Olsson, Rasmus K.
[J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2614 - 2617
[3] DISCRIMINATIVE NON-NEGATIVE MATRIX FACTORIZATION FOR SINGLE-CHANNEL SPEECH SEPARATION
Wang, Zi
Sha, Fei
[J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[4] Speech/Music Separation Using Non-negative Matrix Factorization with Combination of Cost Functions
Nasersharif, Babak
Abdali, Sara
[J]. 2015 INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND SIGNAL PROCESSING (AISP), 2015, : 107 - 111
[5] Adaptation of speaker-specific bases in non-negative matrix factorization for single channel speech-music separation
Grais, Emad M.
Erdogan, Hakan
[J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 576 - 579
[6] Singing Voice Separation for Mono-Channel Music Using Non-negative Matrix Factorization
Chanrungutai, Angkana
Ratanamahatana, Chotirat Ann
[J]. 2008 INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR COMMUNICATIONS, PROCEEDINGS, 2008, : 247 - 250
[7] Perceptually Weighted Non-negative Matrix Factorization for Blind Single-Channel Music Source Separation
Kirbiz, S.
Gunsel, B.
[J]. 2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 226 - 229
[8] Perceptually enhanced blind single-channel music source separation by Non-negative Matrix Factorization
Kirbiz, S.
Gunsel, B.
[J]. DIGITAL SIGNAL PROCESSING, 2013, 23 (02) : 646 - 658
[9] Robust Non-negative Matrix Factorization with β-Divergence for Speech Separation
Li, Yinan
Zhang, Xiongwei
Sun, Meng
[J]. ETRI JOURNAL, 2017, 39 (01) : 21 - 29
[10] Single-Channel Speech Separation Based on Non-negative Matrix Factorization and Factorial Conditional Random Field
LI Xu
TU Ming
WANG Xiaofei
WU Chao
FU Qiang
YAN Yonghong
[J]. Chinese Journal of Electronics, 2018, 27 (05) : 1063 - 1070

← 1 2 3 4 5 →