Local Sparsity Based Online Dictionary Learning for Environment-Adaptive Speech Enhancement with Nonnegative Matrix Factorization

被引:7
|
作者
Jeon, Kwang Myung [1 ]
Kim, Hong Kook [1 ]
机构
[1] GIST, Sch Elect Engn & Comp Sci, Gwangju 61005, South Korea
基金
新加坡国家研究基金会;
关键词
speech enhancement; diverse noise; environment adaptation; nonnegative matrix factorization; online dictionary learning; local sparsity; NOISE;
D O I
10.21437/Interspeech.2016-586
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, a nonnegative matrix factorization (NMF)-based speech enhancement method robust to real and diverse noise is proposed by online NMF dictionary learning without relying on prior knowledge of noise. Conventional NMF-based methods have used a fixed noise dictionary, which often results in performance degradation when the NMF noise dictionary cannot cover noise types that occur in real-life recording. Thus, the noise dictionary needs to be learned from noises according to the variation of recording environments. To this end, the proposed method first estimates noise spectra and then performs online noise dictionary learning by a discriminative NMF learning framework. In particular, the noise spectra are estimated from minimum mean squared error filtering, which is based on the local sparsity defined by a posteriori signal-to-noise ratio (SNR) estimated from the NMF separation of the previous analysis frame. The effectiveness of the proposed speech enhancement method is demonstrated by adding six different realistic noises to clean speech signals with various SNRs Consequently, it is shown that the proposed method outperforms comparative methods in terms of signal-to-distortion ratio (SDR) and perceptual evaluation of speech quality (PESQ) for all kinds of simulated noise and SNR conditions.
引用
收藏
页码:2861 / 2865
页数:5
相关论文
共 50 条
  • [21] Speech enhancement based on nonnegative matrix factorization in constant-Q frequency domain
    Xu, Longting
    Wei, Zhilin
    Zaidi, Syed Faham Ali
    Ren, Bo
    Yang, Jichen
    APPLIED ACOUSTICS, 2021, 174
  • [22] Speech Enhancement Using Convolutive Nonnegative Matrix Factorization with Cosparsity Regularization
    Mirbagheri, Majid
    Xu, Yanbo
    Akram, Sahar
    Shamma, Shihab
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 456 - 459
  • [23] LINEAR DEMIXED DOMAIN MULTICHANNEL NONNEGATIVE MATRIX FACTORIZATION FOR SPEECH ENHANCEMENT
    Taniguchi, Toru
    Masuda, Taro
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 476 - 480
  • [24] A New Approach to Dictionary-Based Nonnegative Matrix Factorization
    Cohen, Jeremy E.
    Gillis, Nicolas
    2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 493 - 497
  • [25] ONLINE NONNEGATIVE MATRIX FACTORIZATION BASED ON KERNEL MACHINES
    Zhu, Fei
    Honeine, Paul
    2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 2381 - 2385
  • [26] A NEW LINEAR MMSE FILTER FOR SINGLE CHANNEL SPEECH ENHANCEMENT BASED ON NONNEGATIVE MATRIX FACTORIZATION
    Mohammadiha, Nasser
    Gerkmann, Timo
    Leijon, Arne
    2011 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2011, : 45 - 48
  • [27] ADAPTIVE ENDMEMBER EXTRACTION BASED SPARSE NONNEGATIVE MATRIX FACTORIZATION WITH SPATIAL LOCAL INFORMATION
    Li, Huali
    Li, Shutao
    Zhang, Liangpei
    2015 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2015, : 1753 - 1756
  • [28] Leveraging Nonnegative Matrix Factorization in Processing the Temporal Modulation Spectrum for Speech Enhancement
    Wang, Syu-Siang
    Yang, Jeremy Chiaming
    Tsao, Yu
    Hung, Jeih-weih
    2016 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS-TAIWAN (ICCE-TW), 2016, : 309 - 310
  • [29] PARALLEL OPTIMIZATION OF HYPERSPECTRAL UNMIXING BASED ON SPARSITY CONSTRAINED NONNEGATIVE MATRIX FACTORIZATION
    Wu, Zebin
    Ye, Shun
    Wei, Jie
    Liu, Jianjun
    Wei, Zhihui
    Sun, Le
    2013 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2013, : 1438 - 1441
  • [30] Environment-Adaptive Online Learning for Portable Energy Storage Based on Porous Electrode Model
    He, Guannan
    Ding, Yongkang
    Wu, Zhengrun
    Chen, Xinjiang
    Zhang, Da
    Song, Jie
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, : 8386 - 8399