Discriminative Nonnegative Dictionary Learning using Cross-Coherence Penalties for Single Channel Source Separation

被引：0

作者：

Grais, Emad M. ^{[1
]}

Erdogan, Hakan ^{[1
]}

机构：

[1] Sabanci Univ, Fac Engn & Nat Sci, TR-34956 Istanbul, Turkey

来源：

14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5 | 2013年

关键词：

Single channel source separation; nonnegative matrix factorization; discriminative training; dictionary learning; MATRIX FACTORIZATION; ALGORITHMS;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this work, we introduce a new discriminative training method for nonnegative dictionary learning. The new method can be used in single channel source separation (SCSS) applications. In SCSS, nonnegative matrix factorization (NMF) is used to learn a dictionary (a set of basis vectors) for each source in the magnitude spectrum domain. The trained dictionaries are then used in decomposing the mixed signal to find the estimate for each source. Learning discriminative dictionaries for the source signals can improve the separation performance. To achieve discriminative dictionaries, we try to avoid the bases set of one source dictionary from representing the other source signals. We propose to minimize cross-coherence between the dictionaries of all sources in the mixed signal. We incorporate a simplified cross-coherence penalty using a regularized NMF cost function to simultaneously learn discriminative and reconstructive dictionaries. The new regularized NMF update rules that are used to discriminatively train the dictionaries are introduced in this work. Experimental results show that using discriminative training gives better separation results than using conventional NMF.

引用

页码：808 / 812

页数：5

共 50 条

[31] Utterance-level Permutation Invariant Training with Discriminative Learning for Single Channel Speech Separation
Fan, Cunhang
Liu, Bin
Tao, Jianhua
Wen, Zhengqi
Yi, Jiangyan
Bai, Ye
2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, : 26 - 30
[32] Low probability of intercept signal separation method using discriminative amplitude-phase dictionary learning
Chen Y.
Zhou Y.
Wang X.
Tian Y.
Zhou D.
Guofang Keji Daxue Xuebao/Journal of National University of Defense Technology, 2019, 41 (03): : 18 - 24
[33] Speaker Independent Single Channel Source Separation Using Sinusoidal Features
Ranjan, Shivesh
Payton, Karen L.
Mowlaee, Pejman
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1522 - 1525
[34] Towards Automated Single Channel Source Separation using Neural Networks
Gang, Arpita
Biyani, Pravesh
Soni, Akshay
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3494 - 3498
[35] Single Channel Blind Source Separation using the Best Characteristic Basis
Gao, Bin
Woo, W. L.
Dlay, S. S.
2008 3RD INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES: FROM THEORY TO APPLICATIONS, VOLS 1-5, 2008, : 795 - 799
[36] SINGLE CHANNEL AUDIO SOURCE SEPARATION USING CONVOLUTIONAL DENOISING AUTOENCODERS
Grais, Emad M.
Plumbley, Mark D.
2017 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2017), 2017, : 1265 - 1269
[37] Single-Channel Source Separation Using Complex Matrix Factorization
King, Brian J.
Atlas, Les
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (08): : 2591 - 2597
[38] Fetal Phonocardiogram Extraction Using Single Channel Blind Source Separation
Samieinasab, Maryam
Sameni, Reza
2015 23RD IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2015, : 78 - 83
[39] Accelerated Proximal Algorithm for Finding the Dantzig Selector and Source Separation Using Dictionary Learning
Ullah, Hayat
Amir, Muhammad
Iqbal, Muhammad
Khan, Ahmad
Khan, Wasim
TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2020, 27 (04): : 1174 - 1180
[40] Single channel speech music separation using nonnegative matrix factorization with sliding windows and spectral masks
Grais, Emad M.
Erdogan, Hakan
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1784 - 1787

← 1 2 3 4 5 →