BASIS COMPENSATION IN NON-NEGATIVE MATRIX FACTORIZATION MODEL FOR SPEECH ENHANCEMENT

被引:0
|
作者
Chung, Hanwook [1 ]
Plourde, Eric [2 ]
Champagne, Benoit [1 ]
机构
[1] McGill Univ, Dept Elect & Comp Engn, Montreal, PQ, Canada
[2] Univ Sherbrooke, Dept Elect & Comp Engn, Sherbrooke, PQ, Canada
关键词
Single-channel speech enhancement; non-negative matrix factorization; supervised algorithm; basis adaptation; NOISE; SEPARATION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we propose a basis compensation algorithm for non-negative matrix factorization (NMF) models as applied to supervised single-channel speech enhancement. In the proposed framework, we use extra free basis vectors for both the clean speech and noise during the enhancement stage in order to capture the features which are not included in the training data. Specifically, the free basis vectors of the clean speech are obtained by exploiting a priori knowledge based on a Gamma distribution. The free bases of the noise are estimated using a regularization approach, which enforces them to be orthogonal to the clean speech and noise basis vectors estimated during the training stage. Experimental results show that the proposed NMF algorithm with basis compensation provides better performance in speech enhancement than the benchmark algorithms.
引用
收藏
页码:2249 / 2253
页数:5
相关论文
共 50 条
  • [1] Speech Enhancement Using Sparse Convolutive Non-negative Matrix Factorization with Basis Adaptation
    Carlin, Michael A.
    Malyska, Nicolas
    Quatieri, Thomas F.
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 582 - 585
  • [2] Regularized non-negative matrix factorization with Gaussian mixtures and masking model for speech enhancement
    Chung, Hanwook
    Plourde, Eric
    Champagne, Benoit
    SPEECH COMMUNICATION, 2017, 87 : 18 - 30
  • [3] Non-negative Tensor Factorization for Speech Enhancement
    He, Liang
    Zhang, Weiqiang
    Shi, Mengnan
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE: TECHNOLOGIES AND APPLICATIONS, 2016, 127
  • [4] Non-Negative Matrix Factorization Based Compensation of Music for Automatic Speech Recognition
    Raj, Bhiksha
    Virtanen, Tuomas
    Chaudhuri, Sourish
    Singh, Rita
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 717 - +
  • [5] Feature enhancement of reverberant speech by distribution matching and non-negative matrix factorization
    Keronen, Sami
    Kallasjoki, Heikki
    Palomaki, Kalle J.
    Brown, Guy J.
    Gemmeke, Jort F.
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2015,
  • [6] Feature enhancement of reverberant speech by distribution matching and non-negative matrix factorization
    Sami Keronen
    Heikki Kallasjoki
    Kalle J. Palomäki
    Guy J. Brown
    Jort F. Gemmeke
    EURASIP Journal on Advances in Signal Processing, 2015
  • [7] A supervised non-negative matrix factorization model for speech emotion recognition
    Hou, Mixiao
    Li, Jinxing
    Lu, Guangming
    SPEECH COMMUNICATION, 2020, 124 : 13 - 20
  • [8] Non-negative Matrix Factorization with Linear Constraints for Single-Channel Speech Enhancement
    Lyubimov, Nikolay
    Kotov, Mikhail
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 446 - 450
  • [9] A DNN-HMM Approach to Non-negative Matrix Factorization Based Speech Enhancement
    Wang, Ziteng
    Li, Xu
    Wang, Xiaofei
    Fu, Qiang
    Yan, Yonghong
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3763 - 3767
  • [10] Non-negative Matrix Factorization Speech Enhancement Method Based on Constraints of Temporal Continuity
    Zou, Qiang
    Sun, Chengli
    Yuan, Conglin
    Sun, Yifan
    PROCEEDINGS OF 2019 IEEE 3RD INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2019), 2019, : 542 - 546