Discriminative Nonnegative Dictionary Learning using Cross-Coherence Penalties for Single Channel Source Separation

被引:0
|
作者
Grais, Emad M. [1 ]
Erdogan, Hakan [1 ]
机构
[1] Sabanci Univ, Fac Engn & Nat Sci, TR-34956 Istanbul, Turkey
关键词
Single channel source separation; nonnegative matrix factorization; discriminative training; dictionary learning; MATRIX FACTORIZATION; ALGORITHMS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we introduce a new discriminative training method for nonnegative dictionary learning. The new method can be used in single channel source separation (SCSS) applications. In SCSS, nonnegative matrix factorization (NMF) is used to learn a dictionary (a set of basis vectors) for each source in the magnitude spectrum domain. The trained dictionaries are then used in decomposing the mixed signal to find the estimate for each source. Learning discriminative dictionaries for the source signals can improve the separation performance. To achieve discriminative dictionaries, we try to avoid the bases set of one source dictionary from representing the other source signals. We propose to minimize cross-coherence between the dictionaries of all sources in the mixed signal. We incorporate a simplified cross-coherence penalty using a regularized NMF cost function to simultaneously learn discriminative and reconstructive dictionaries. The new regularized NMF update rules that are used to discriminatively train the dictionaries are introduced in this work. Experimental results show that using discriminative training gives better separation results than using conventional NMF.
引用
收藏
页码:808 / 812
页数:5
相关论文
共 50 条
  • [31] Utterance-level Permutation Invariant Training with Discriminative Learning for Single Channel Speech Separation
    Fan, Cunhang
    Liu, Bin
    Tao, Jianhua
    Wen, Zhengqi
    Yi, Jiangyan
    Bai, Ye
    2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, : 26 - 30
  • [32] Low probability of intercept signal separation method using discriminative amplitude-phase dictionary learning
    Chen Y.
    Zhou Y.
    Wang X.
    Tian Y.
    Zhou D.
    Guofang Keji Daxue Xuebao/Journal of National University of Defense Technology, 2019, 41 (03): : 18 - 24
  • [33] Speaker Independent Single Channel Source Separation Using Sinusoidal Features
    Ranjan, Shivesh
    Payton, Karen L.
    Mowlaee, Pejman
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1522 - 1525
  • [34] Towards Automated Single Channel Source Separation using Neural Networks
    Gang, Arpita
    Biyani, Pravesh
    Soni, Akshay
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3494 - 3498
  • [35] Single Channel Blind Source Separation using the Best Characteristic Basis
    Gao, Bin
    Woo, W. L.
    Dlay, S. S.
    2008 3RD INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES: FROM THEORY TO APPLICATIONS, VOLS 1-5, 2008, : 795 - 799
  • [36] SINGLE CHANNEL AUDIO SOURCE SEPARATION USING CONVOLUTIONAL DENOISING AUTOENCODERS
    Grais, Emad M.
    Plumbley, Mark D.
    2017 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2017), 2017, : 1265 - 1269
  • [37] Single-Channel Source Separation Using Complex Matrix Factorization
    King, Brian J.
    Atlas, Les
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (08): : 2591 - 2597
  • [38] Fetal Phonocardiogram Extraction Using Single Channel Blind Source Separation
    Samieinasab, Maryam
    Sameni, Reza
    2015 23RD IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2015, : 78 - 83
  • [39] Accelerated Proximal Algorithm for Finding the Dantzig Selector and Source Separation Using Dictionary Learning
    Ullah, Hayat
    Amir, Muhammad
    Iqbal, Muhammad
    Khan, Ahmad
    Khan, Wasim
    TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2020, 27 (04): : 1174 - 1180
  • [40] Single channel speech music separation using nonnegative matrix factorization with sliding windows and spectral masks
    Grais, Emad M.
    Erdogan, Hakan
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1784 - 1787