Speech Enhancement Using Convolutive Nonnegative Matrix Factorization with Cosparsity Regularization

被引:0
|
作者
Mirbagheri, Majid [1 ]
Xu, Yanbo [1 ]
Akram, Sahar [1 ]
Shamma, Shihab [1 ,2 ]
机构
[1] Univ Maryland, Syst Res Inst, College Pk, MD 20742 USA
[2] Univ Maryland, Dept Elect & Comp Engn, College Pk, MD 20742 USA
关键词
speech enhancement; cosparsity; convolutive nonnegative matrix factorization;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A novel method for speech enhancement based on Convolutive Non -negative Matrix Factorization (CNMF) is presented in this paper. The sparsity of activation matrix for speech components has already been utilized in NMF-based enhancement methods. However such methods do not usually take into account prior knowledge about occurrence relations between different speech components. By introducing the notion of cosparsity, we demonstrate how such relations can be characterized from available speech data and enforced when recovering speech from noisy mixtures. Through objective evaluations we show our proposed regularization improves sparse reconstruction of speech, especially in low SNR conditions.
引用
收藏
页码:456 / 459
页数:4
相关论文
共 50 条
  • [1] Transductive Convolutive Nonnegative Matrix Factorization for Speech Separation
    Mai, Yaodan
    Lan, Long
    Guan, Naiyang
    Zhang, Xiang
    Luo, Zhigang
    [J]. PROCEEDINGS OF 2015 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2015), 2015, : 1400 - 1404
  • [2] SPEECH ENHANCEMENT USING SEGMENTAL NONNEGATIVE MATRIX FACTORIZATION
    Fan, Hao-Teng
    Hung, Jeih-weih
    Lu, Xugang
    Wang, Syu-Siang
    Tsao, Yu
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [3] Supervised and Unsupervised Speech Enhancement Using Nonnegative Matrix Factorization
    Mohammadiha, Nasser
    Smaragdis, Paris
    Leijon, Arne
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (10): : 2140 - 2151
  • [4] SPEECH ENHANCEMENT USING NONNEGATIVE MATRIX FACTORIZATION WITH TEMPORAL CONTINUITY
    Nam, Seung-Hyon
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2015, 34 (03): : 240 - 246
  • [5] A Bayesian approach to convolutive nonnegative matrix factorization for blind speech dereverberation
    Ibarrola, Francisco J.
    Di Persia, Leandro E.
    Spies, Ruben D.
    [J]. SIGNAL PROCESSING, 2018, 151 : 89 - 98
  • [6] Speech Enhancement Using Convolutive Non negative Matrix Factorization with Multiple Dictionaries
    He, Shan
    Wu, Jiawen
    Li, Lin
    [J]. 2015 IEEE 9TH INTERNATIONAL CONFERENCE ON ANTI-COUNTERFEITING, SECURITY, AND IDENTIFICATION (ASID), 2015, : 105 - 108
  • [7] On the use of convolutive nonnegative matrix factorization with mixed penalization for blind speech dereverberation
    Ibarrola, Francisco
    Di Persia, Leandro
    Spies, Ruben
    [J]. 2017 XLIII LATIN AMERICAN COMPUTER CONFERENCE (CLEI), 2017,
  • [8] Wavelet Speech Enhancement Based on Nonnegative Matrix Factorization
    Wang, Syu-Siang
    Chern, Alan
    Tsao, Yu
    Hung, Jeih-weih
    Lu, Xugang
    Lai, Ying-Hui
    Su, Borching
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2016, 23 (08) : 1101 - 1105
  • [9] THE NONNEGATIVE MATRIX FACTORIZATION: REGULARIZATION AND COMPLEXITY
    Ito, K.
    Landi, A. K.
    [J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2016, 38 (02): : B327 - B346
  • [10] Speech Enhancement Based on Codebook Constrained Nonnegative Matrix Factorization
    Bai, Zhigang
    Bao, Changchun
    Yan, Bofang
    [J]. 2018 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), 2018, : 361 - 365