Local Sparsity Based Online Dictionary Learning for Environment-Adaptive Speech Enhancement with Nonnegative Matrix Factorization

被引:7
|
作者
Jeon, Kwang Myung [1 ]
Kim, Hong Kook [1 ]
机构
[1] GIST, Sch Elect Engn & Comp Sci, Gwangju 61005, South Korea
基金
新加坡国家研究基金会;
关键词
speech enhancement; diverse noise; environment adaptation; nonnegative matrix factorization; online dictionary learning; local sparsity; NOISE;
D O I
10.21437/Interspeech.2016-586
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, a nonnegative matrix factorization (NMF)-based speech enhancement method robust to real and diverse noise is proposed by online NMF dictionary learning without relying on prior knowledge of noise. Conventional NMF-based methods have used a fixed noise dictionary, which often results in performance degradation when the NMF noise dictionary cannot cover noise types that occur in real-life recording. Thus, the noise dictionary needs to be learned from noises according to the variation of recording environments. To this end, the proposed method first estimates noise spectra and then performs online noise dictionary learning by a discriminative NMF learning framework. In particular, the noise spectra are estimated from minimum mean squared error filtering, which is based on the local sparsity defined by a posteriori signal-to-noise ratio (SNR) estimated from the NMF separation of the previous analysis frame. The effectiveness of the proposed speech enhancement method is demonstrated by adding six different realistic noises to clean speech signals with various SNRs Consequently, it is shown that the proposed method outperforms comparative methods in terms of signal-to-distortion ratio (SDR) and perceptual evaluation of speech quality (PESQ) for all kinds of simulated noise and SNR conditions.
引用
收藏
页码:2861 / 2865
页数:5
相关论文
共 50 条
  • [31] Unsupervised Robust Speech Enhancement Based on Alpha-Stable Fast Multichannel Nonnegative Matrix Factorization
    Fontaine, Mathieu
    Sekiguchi, Kouhei
    Nugraha, Aditya Arie
    Yoshii, Kazuyoshi
    INTERSPEECH 2020, 2020, : 4541 - 4545
  • [32] On microphone arrangement for multichannel speech enhancement based on nonnegative matrix factorization in time-channel domain
    Murase, Yoshikazu
    Chiba, Hironobu
    Ono, Nobutaka
    Miyabe, Shigeki
    Yamada, Takeshi
    Makino, Shoji
    2014 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2014,
  • [33] Environment-adaptive speech enhancement for bilateral cochlear implants using a single processor
    Mirzahasanloo, Taher S.
    Kehtarnavaz, Nasser
    Gopalakrishna, Vanishree
    Loizou, Philipos C.
    SPEECH COMMUNICATION, 2013, 55 (04) : 523 - 534
  • [34] Efficient Model Selection for Speech Enhancement Using a Deflation Method for Nonnegative Matrix Factorization
    Kim, Minje
    Smaragdis, Paris
    2014 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2014, : 537 - 541
  • [35] Supervised and Semi-supervised Speech Enhancement Using Weighted Nonnegative Matrix Factorization
    Zou, Xia
    Hu, Yonggang
    Zhang, Xiongwei
    2017 9TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2017,
  • [36] Noise Suppression based on nonnegative matrix factorization for robust speech recognition
    Fan, Hao-teng
    Lin, Pao-han
    Hung, Jeih-weih
    2014 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE, ELECTRONICS AND ELECTRICAL ENGINEERING (ISEEE), VOLS 1-3, 2014, : 1731 - +
  • [37] Speech Enhancement Based on Dictionary Learning and Low-Rank Matrix Decomposition
    Ji, Yunyun
    Zhu, Wei-Ping
    Champagne, Benoit
    IEEE ACCESS, 2019, 7 : 4936 - 4947
  • [38] Online Algorithm for Foreground Detection Based on Incremental Nonnegative Matrix Factorization
    Chen, Rong'an
    Li, Hui
    PROCEEDINGS OF 2016 THE 2ND INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND ROBOTICS, 2016, : 312 - 317
  • [39] Nonnegative Matrix Factorization Based Transfer Subspace Learning for Cross-Corpus Speech Emotion Recognition
    Luo, Hui
    Han, Jiqing
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 2047 - 2060
  • [40] Online matrix factorization for markovian data and applications to network dictionary learning
    Lyu, Hanbaek
    Needell, Deanna
    Balzano, Laura
    Journal of Machine Learning Research, 2020, 21