Single channel speech music separation using nonnegative matrix factorization with sliding windows and spectral masks

被引:0
|
作者
Grais, Emad M. [1 ]
Erdogan, Hakan [1 ]
机构
[1] Sabanci Univ, Fac Engn & Nat Sci, TR-34956 Istanbul, Turkey
关键词
Single channel source separation; source separation; semi-blind source separation; speech music separation; speech processing; nonnegative matrix factorization; Wiener filter;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A single channel speech-music separation algorithm based on nonnegative matrix factorization (NMF) with sliding windows and spectral masks is proposed in this work. We train a set of basis vectors for each source signal using NMF in the magnitude spectral domain. Rather than forming the columns of the matrices to be decomposed by NMF of a single spectral frame, we build them with multiple spectral frames stacked in one column. After observing the mixed signal, NMF is used to decompose its magnitude spectra into a weighted linear combination of the trained basis vector for both sources. An initial spectrogram estimate for each source is found, and a spectral mask is built using these initial estimates. This mask is used to weight the mixed signal spectrogram to find the contributions of each source signal in the mixed signal. The method is shown to perform better than the conventional NMF approach.
引用
收藏
页码:1784 / 1787
页数:4
相关论文
共 50 条
  • [11] SINGLE CHANNEL SOURCE SEPARATION USING SMOOTH NONNEGATIVE MATRIX FACTORIZATION WITH MARKOV RANDOM FIELDS
    Kim, Minje
    Smaragdis, Paris
    2013 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2013,
  • [12] Single-Channel Signal Separation Using Spectral Basis Correlation with Sparse Nonnegative Tensor Factorization
    P. Parathai
    N. Tengtrairat
    W. L. Woo
    Bin Gao
    Circuits, Systems, and Signal Processing, 2019, 38 : 5786 - 5816
  • [13] Single-Channel Signal Separation Using Spectral Basis Correlation with Sparse Nonnegative Tensor Factorization
    Parathai, P.
    Tengtrairat, N.
    Woo, W. L.
    Gao, Bin
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2019, 38 (12) : 5786 - 5816
  • [14] Regularized nonnegative matrix factorization using Gaussian mixture priors for supervised single channel source separation
    Grais, Emad M.
    Erdogan, Hakan
    COMPUTER SPEECH AND LANGUAGE, 2013, 27 (03): : 746 - 762
  • [15] On-line Nonnegative Matrix Factorization for Music Signal Separation
    Lee, Seokjin
    2014 TENTH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING (IIH-MSP 2014), 2014, : 566 - 569
  • [16] Performance Evaluation of Single Channel Speech Separation Using Non-Negative Matrix Factorization
    Nandakumar, Mona M.
    Bijoy, Edet K.
    2014 NATIONAL CONFERENCE ON COMMUNICATION, SIGNAL PROCESSING AND NETWORKING (NCCSN), 2014,
  • [17] Single-Channel Speech Separation using Sparse Non-Negative Matrix Factorization
    Schmidt, Mikkel N.
    Olsson, Rasmus K.
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2614 - 2617
  • [18] SINUSOIDAL MASKS FOR SINGLE CHANNEL SPEECH SEPARATION
    Mowlaee, Pejman
    Christensen, Mads Graesboll
    Jensen, Soren Holdt
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4262 - 4265
  • [19] Speech denoising using nonnegative matrix factorization with priors
    Wilson, Kevin W.
    Raj, Bhiksha
    Smaragdis, Paris
    Divakaran, Ajay
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4029 - +
  • [20] SPEECH ENHANCEMENT USING SEGMENTAL NONNEGATIVE MATRIX FACTORIZATION
    Fan, Hao-Teng
    Hung, Jeih-weih
    Lu, Xugang
    Wang, Syu-Siang
    Tsao, Yu
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,