Non-negative Matrix Factorization Speech Enhancement Method Based on Constraints of Temporal Continuity

被引:0
|
作者
Zou, Qiang [1 ]
Sun, Chengli [1 ]
Yuan, Conglin [1 ]
Sun, Yifan [2 ]
机构
[1] Nanchang Hangkong Univ, Sch Informat, Nanchang, Jiangxi, Peoples R China
[2] Univ Manchester, Sch Elect & Elect Engn, Manchester, Lancs, England
基金
中国国家自然科学基金;
关键词
Speech enhancement; Temporal continuity; Non-negative matrix factorization; Sparse matrix reconstruction; SUBSPACE APPROACH; DECOMPOSITION; NOISE; RANK;
D O I
10.1109/itnec.2019.8729254
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The constrained low-rank and sparse matrix decomposition (CLSMD) method ignores the temporal continuity between adjacent speech frames in the process of speech enhancement, resulting in a sparse matrix generated by decomposition with isolated discrete points. Therefore, in order to improve the noise suppression ability of the speech system and improve the enhanced speech quality and intelligibility, this paper proposes a speech enhancement method based on Temporal continuity Constraint for Non-negative Low-rank and Sparse Matrix Decomposition (TCNLSMD). In this method, in addition to adding low -rank and sparse constraints, temporal continuity constraints are added. The proposed method based on the sparse matrix obtained by eigenvalue decomposition of non-negative matrices and hard-threshold function estimation, the discrete sparse matrix is reduced by adding temporal continuity constraints to reduce discrete isolated points, retaining more speech information and reducing the enhanced speech distortion. The experimental results show that under various types of noise test conditions, compared with the current mainstream speech enhancement methods, especially with NLSMD, the proposed method improve the noise suppression capability, make the residual noise less, and improve the quality of the enhanced speech.
引用
收藏
页码:542 / 546
页数:5
相关论文
共 50 条
  • [1] Non-negative Matrix Factorization with Linear Constraints for Single-Channel Speech Enhancement
    Lyubimov, Nikolay
    Kotov, Mikhail
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 446 - 450
  • [2] Non-Negative Matrix Factorization with Constraints
    Liu, Haifeng
    Wu, Zhaohui
    [J]. PROCEEDINGS OF THE TWENTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-10), 2010, : 506 - 511
  • [3] BASIS COMPENSATION IN NON-NEGATIVE MATRIX FACTORIZATION MODEL FOR SPEECH ENHANCEMENT
    Chung, Hanwook
    Plourde, Eric
    Champagne, Benoit
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 2249 - 2253
  • [4] Regularized Non-negative Matrix Factorization with Temporal Dependencies for Speech Denoising
    Wilson, Kevin W.
    Raj, Bhiksha
    Smaragdis, Paris
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 411 - +
  • [5] A DNN-HMM Approach to Non-negative Matrix Factorization Based Speech Enhancement
    Wang, Ziteng
    Li, Xu
    Wang, Xiaofei
    Fu, Qiang
    Yan, Yonghong
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3763 - 3767
  • [6] Non-negative Tensor Factorization for Speech Enhancement
    He, Liang
    Zhang, Weiqiang
    Shi, Mengnan
    [J]. PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE: TECHNOLOGIES AND APPLICATIONS, 2016, 127
  • [7] Non-negative matrix factorization with sparseness constraints
    Hoyer, PO
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2004, 5 : 1457 - 1469
  • [8] Feature enhancement of reverberant speech by distribution matching and non-negative matrix factorization
    Keronen, Sami
    Kallasjoki, Heikki
    Palomaki, Kalle J.
    Brown, Guy J.
    Gemmeke, Jort F.
    [J]. EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2015,
  • [9] Feature enhancement of reverberant speech by distribution matching and non-negative matrix factorization
    Sami Keronen
    Heikki Kallasjoki
    Kalle J. Palomäki
    Guy J. Brown
    Jort F. Gemmeke
    [J]. EURASIP Journal on Advances in Signal Processing, 2015
  • [10] STATISTICAL SPEECH ENHANCEMENT BASED ON PROBABILISTIC INTEGRATION OF VARIATIONAL AUTOENCODER AND NON-NEGATIVE MATRIX FACTORIZATION
    Bando, Yoshiaki
    Mimura, Masato
    Itoyama, Katsutoshi
    Yoshii, Kazuyoshi
    Kawahara, Tatsuya
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 716 - 720