Single channel speech music separation using nonnegative matrix factorization with sliding windows and spectral masks

被引：0

作者：

Grais, Emad M. ^{[1
]}

Erdogan, Hakan ^{[1
]}

机构：

[1] Sabanci Univ, Fac Engn & Nat Sci, TR-34956 Istanbul, Turkey

来源：

12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5 | 2011年

关键词：

Single channel source separation; source separation; semi-blind source separation; speech music separation; speech processing; nonnegative matrix factorization; Wiener filter;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

A single channel speech-music separation algorithm based on nonnegative matrix factorization (NMF) with sliding windows and spectral masks is proposed in this work. We train a set of basis vectors for each source signal using NMF in the magnitude spectral domain. Rather than forming the columns of the matrices to be decomposed by NMF of a single spectral frame, we build them with multiple spectral frames stacked in one column. After observing the mixed signal, NMF is used to decompose its magnitude spectra into a weighted linear combination of the trained basis vector for both sources. An initial spectrogram estimate for each source is found, and a spectral mask is built using these initial estimates. This mask is used to weight the mixed signal spectrogram to find the contributions of each source signal in the mixed signal. The method is shown to perform better than the conventional NMF approach.

引用

下载

页码：1784 / 1787

页数：4

共 50 条

[21] A NEW LINEAR MMSE FILTER FOR SINGLE CHANNEL SPEECH ENHANCEMENT BASED ON NONNEGATIVE MATRIX FACTORIZATION
Mohammadiha, Nasser
Gerkmann, Timo
Leijon, Arne
2011 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2011, : 45 - 48
[22] Adaptive recurrent nonnegative matrix factorization with phase compensation for Single-Channel speech enhancement
Tank, Vanita Raj
Mahajan, Shrinivas Padmakar
MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (20) : 28249 - 28294
[23] Adaptive recurrent nonnegative matrix factorization with phase compensation for Single-Channel speech enhancement
Vanita Raj Tank
Shrinivas Padmakar Mahajan
Multimedia Tools and Applications, 2022, 81 : 28249 - 28294
[24] Music Signal Separation by Supervised Nonnegative Matrix Factorization with Basis Deformation
Kitamura, Daichi
Saruwatari, Hiroshi
Shikano, Kiyohiro
Kondo, Kazunobu
Takahashi, Yu
2013 18TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2013,
[25] Nonnegative matrix factorization 2D with the flexible β-Divergence for Single Channel Source Separation
Yu, Kaiwen
Woo, W. L.
Dlay, S. S.
2015 IEEE INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS 2015), 2015,
[26] SINGLE-CHANNEL SPEECH SEPARATION BY INCLUDING SPECTRAL STRUCTURE INFORMATION WITHIN NON-NEGATIVE MATRIX FACTORIZATION
Feng, Yuxiao
Ritz, Christian
2015 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING, 2015, : 620 - 624
[27] Supervised and Unsupervised Speech Enhancement Using Nonnegative Matrix Factorization
Mohammadiha, Nasser
Smaragdis, Paris
Leijon, Arne
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (10): : 2140 - 2151
[28] SPEECH ENHANCEMENT USING NONNEGATIVE MATRIX FACTORIZATION WITH TEMPORAL CONTINUITY
Nam, Seung-Hyon
JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2015, 34 (03): : 240 - 246
[29] Music Enhancement Using Nonnegative Matrix Factorization with Penalty Masking
Lin, ChingShun
Cheng, ZongChao
Shih, DongLiang
2013 IEEE 16TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE 2013), 2013, : 125 - 129
[30] DISCRIMINATIVE NON-NEGATIVE MATRIX FACTORIZATION FOR SINGLE-CHANNEL SPEECH SEPARATION
Wang, Zi
Sha, Fei
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,

← 1 2 3 4 5 →