Single channel speech music separation using nonnegative matrix factorization with sliding windows and spectral masks

被引:0
|
作者
Grais, Emad M. [1 ]
Erdogan, Hakan [1 ]
机构
[1] Sabanci Univ, Fac Engn & Nat Sci, TR-34956 Istanbul, Turkey
关键词
Single channel source separation; source separation; semi-blind source separation; speech music separation; speech processing; nonnegative matrix factorization; Wiener filter;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A single channel speech-music separation algorithm based on nonnegative matrix factorization (NMF) with sliding windows and spectral masks is proposed in this work. We train a set of basis vectors for each source signal using NMF in the magnitude spectral domain. Rather than forming the columns of the matrices to be decomposed by NMF of a single spectral frame, we build them with multiple spectral frames stacked in one column. After observing the mixed signal, NMF is used to decompose its magnitude spectra into a weighted linear combination of the trained basis vector for both sources. An initial spectrogram estimate for each source is found, and a spectral mask is built using these initial estimates. This mask is used to weight the mixed signal spectrogram to find the contributions of each source signal in the mixed signal. The method is shown to perform better than the conventional NMF approach.
引用
收藏
页码:1784 / 1787
页数:4
相关论文
共 50 条
  • [1] Nonnegative Matrix Factorization with Disjointness Constraints for Single Channel Speech Separation
    Huang, Jianjun
    Zhang, Xiongwei
    Zhang, Yafei
    Wu, Haijia
    [J]. PROCEEDINGS OF 2012 IEEE 14TH INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY, 2012, : 1149 - 1153
  • [2] Single Channel Music and Speech Separation Using Non-negative Matrix Factorization
    Yidirim, Sinan
    Saraclar, Murat
    [J]. 2009 IEEE 17TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, VOLS 1 AND 2, 2009, : 543 - 546
  • [3] Toward Finding Optimal Source Dictionaries for Single Channel Music Source Separation Using Nonnegative Matrix Factorization
    Rathnayake, Bhathiya
    Weerakoon, K. M. K.
    Godaliyadda, G. M. R., I
    Ekanayake, M. P. B.
    [J]. 2018 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI), 2018, : 1493 - 1500
  • [4] Supervised Separation of Speech from Background Piano Music using a Nonnegative Matrix Factorization Approach
    Martinez-Colon, A.
    Canadas-Quesada, F. J.
    Vera-Candeas, P.
    Ruiz-Reyes, N.
    Moreno-Fuentes, F.
    [J]. STAIRS 2014, 2014, 264 : 181 - 190
  • [5] Layered Nonnegative Matrix Factorization for Speech Separation
    Hsu, Chung-Chien
    Chien, Jen-Tzung
    Chi, Tai-Shih
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 628 - 632
  • [6] Initialization of Nonnegative Matrix Factorization Dictionaries for Single Channel Source Separation
    Grais, Emad M.
    Erdogan, Hakan
    [J]. 2013 21ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2013,
  • [7] Discriminative Layered Nonnegative Matrix Factorization for Speech Separation
    Hsu, Chung-Chien
    Chi, Tai-Shih
    Chien, Jen-Tzung
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 560 - 564
  • [8] Transductive Convolutive Nonnegative Matrix Factorization for Speech Separation
    Mai, Yaodan
    Lan, Long
    Guan, Naiyang
    Zhang, Xiang
    Luo, Zhigang
    [J]. PROCEEDINGS OF 2015 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2015), 2015, : 1400 - 1404
  • [9] Deep Transductive Nonnegative Matrix Factorization for Speech Separation
    Liu, Yalin
    Guan, Naiyang
    Liu, Jie
    [J]. 2017 16TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2017, : 249 - 254
  • [10] Single-channel Music/Speech Separation Using Non-linear Masks
    Mowlaee, P.
    Sayadian, A.
    Sheikhan, M.
    Fallah, M.
    [J]. 2008 INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS, VOLS 1 AND 2, 2008, : 782 - +