Single Channel Music and Speech Separation Using Non-negative Matrix Factorization

被引:0
|
作者
Yidirim, Sinan [1 ]
Saraclar, Murat [1 ]
机构
[1] Bogazici Univ, Elekt Elekt Muhendisligi Bolumu, TR-34342 Istanbul, Turkey
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper non-negative matrix factorization (NMF) is used to separate speech and music signals based on a single channel recording. The assumption that if two independent zero-mean signals are added then their energies are also added has led us to develop a two-stage method (training and separation) that works on time-frequency domain. The performance of the method in separation is evaluated by observing the power of the separated signals in time-frequency domain, and by measuring the increase in signal-to-interference and signal-to-noise ratios after separation. Finally, we discuss the problems faced and the work that can be done in future to enhance the performance of the method in separation.
引用
收藏
页码:543 / 546
页数:4
相关论文
共 50 条
  • [1] Performance Evaluation of Single Channel Speech Separation Using Non-Negative Matrix Factorization
    Nandakumar, Mona M.
    Bijoy, Edet K.
    [J]. 2014 NATIONAL CONFERENCE ON COMMUNICATION, SIGNAL PROCESSING AND NETWORKING (NCCSN), 2014,
  • [2] Single-Channel Speech Separation using Sparse Non-Negative Matrix Factorization
    Schmidt, Mikkel N.
    Olsson, Rasmus K.
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2614 - 2617
  • [3] DISCRIMINATIVE NON-NEGATIVE MATRIX FACTORIZATION FOR SINGLE-CHANNEL SPEECH SEPARATION
    Wang, Zi
    Sha, Fei
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [4] Speech/Music Separation Using Non-negative Matrix Factorization with Combination of Cost Functions
    Nasersharif, Babak
    Abdali, Sara
    [J]. 2015 INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND SIGNAL PROCESSING (AISP), 2015, : 107 - 111
  • [5] Adaptation of speaker-specific bases in non-negative matrix factorization for single channel speech-music separation
    Grais, Emad M.
    Erdogan, Hakan
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 576 - 579
  • [6] Singing Voice Separation for Mono-Channel Music Using Non-negative Matrix Factorization
    Chanrungutai, Angkana
    Ratanamahatana, Chotirat Ann
    [J]. 2008 INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR COMMUNICATIONS, PROCEEDINGS, 2008, : 247 - 250
  • [7] Perceptually Weighted Non-negative Matrix Factorization for Blind Single-Channel Music Source Separation
    Kirbiz, S.
    Gunsel, B.
    [J]. 2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 226 - 229
  • [8] Perceptually enhanced blind single-channel music source separation by Non-negative Matrix Factorization
    Kirbiz, S.
    Gunsel, B.
    [J]. DIGITAL SIGNAL PROCESSING, 2013, 23 (02) : 646 - 658
  • [9] Robust Non-negative Matrix Factorization with β-Divergence for Speech Separation
    Li, Yinan
    Zhang, Xiongwei
    Sun, Meng
    [J]. ETRI JOURNAL, 2017, 39 (01) : 21 - 29
  • [10] Single-Channel Speech Separation Based on Non-negative Matrix Factorization and Factorial Conditional Random Field
    LI Xu
    TU Ming
    WANG Xiaofei
    WU Chao
    FU Qiang
    YAN Yonghong
    [J]. Chinese Journal of Electronics, 2018, 27 (05) : 1063 - 1070