Supervised and Unsupervised Speech Enhancement Using Nonnegative Matrix Factorization

被引:328
|
作者
Mohammadiha, Nasser [1 ]
Smaragdis, Paris [2 ,3 ]
Leijon, Arne [1 ]
机构
[1] KTH Royal Inst Technol, Dept Elect Engn, SE-10044 Stockholm, Sweden
[2] Univ Illinois, Dept Comp Sci, Urbana, IL 61801 USA
[3] Univ Illinois, Dept Elect & Comp Engn, Urbana, IL 61801 USA
关键词
Bayesian inference; HMM; nonnegative matrix factorization (NMF); PLCA; speech enhancement; SQUARE ERROR ESTIMATION; NOISE; SEPARATION; SIGNALS;
D O I
10.1109/TASL.2013.2270369
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Reducing the interference noise in a monaural noisy speech signal has been a challenging task for many years. Compared to traditional unsupervised speech enhancement methods, e. g., Wiener filtering, supervised approaches, such as algorithms based on hidden Markov models (HMM), lead to higher-quality enhanced speech signals. However, the main practical difficulty of these approaches is that for each noise type a model is required to be trained a priori. In this paper, we investigate a new class of supervised speech denoising algorithms using nonnegative matrix factorization (NMF). We propose a novel speech enhancement method that is based on a Bayesian formulation of NMF (BNMF). To circumvent the mismatch problem between the training and testing stages, we propose two solutions. First, we use an HMM in combination with BNMF (BNMF-HMM) to derive a minimum mean square error (MMSE) estimator for the speech signal with no information about the underlying noise type. Second, we suggest a scheme to learn the required noise BNMF model online, which is then used to develop an unsupervised speech enhancement system. Extensive experiments are carried out to investigate the performance of the proposed methods under different conditions. Moreover, we compare the performance of the developed algorithms with state-of-the-art speech enhancement schemes using various objective measures. Our simulations show that the proposed BNMF-based methods outperform the competing algorithms substantially.
引用
收藏
页码:2140 / 2151
页数:12
相关论文
共 50 条
  • [21] Continuous Semi-Supervised Nonnegative Matrix Factorization
    Lindstrom, Michael R. R.
    Ding, Xiaofu
    Liu, Feng
    Somayajula, Anand
    Needell, Deanna
    ALGORITHMS, 2023, 16 (04)
  • [22] Supervised kernel nonnegative matrix factorization for face recognition
    Chen, Wen-Sheng
    Zhao, Yang
    Pan, Binbin
    Chen, Bo
    NEUROCOMPUTING, 2016, 205 : 165 - 181
  • [23] Self-Supervised Symmetric Nonnegative Matrix Factorization
    Jia, Yuheng
    Liu, Hui
    Hou, Junhui
    Kwong, Sam
    Zhang, Qingfu
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (07) : 4526 - 4537
  • [24] Robust Semi-supervised Nonnegative Matrix Factorization
    Wang, Jing
    Tian, Feng
    Liu, Chang Hong
    Wang, Xiao
    2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [25] RECOGNIZE AND SEPARATE APPROACH FOR SPEECH DENOISING USING NONNEGATIVE MATRIX FACTORIZATION
    Sohrab, Fahad
    Erdogan, Hakan
    2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 1083 - 1087
  • [26] AMPLITUDE-BASED SPEECH ENHANCEMENT WITH NONNEGATIVE MATRIX FACTORIZATION FOR ASYNCHRONOUS DISTRIBUTED RECORDING
    Chiba, Hironobu
    Ono, Nobutaka
    Miyabe, Shigeki
    Takahashi, Yu
    Yamada, Takeshi
    Makino, Shoji
    2014 14TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2014, : 203 - 207
  • [27] TRANSDUCTIVE NONNEGATIVE MATRIX FACTORIZATION FOR SEMI-SUPERVISED HIGH-PERFORMANCE SPEECH SEPARATION
    Guan, Naiyang
    Lan, Long
    Tao, Dacheng
    Luo, Zhigang
    Yang, Xuejun
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [28] Speech enhancement based on nonnegative matrix factorization in constant-Q frequency domain
    Xu, Longting
    Wei, Zhilin
    Zaidi, Syed Faham Ali
    Ren, Bo
    Yang, Jichen
    APPLIED ACOUSTICS, 2021, 174
  • [29] Network Embedding Using Semi-Supervised Kernel Nonnegative Matrix Factorization
    He, Chaobo
    Zhang, Qiong
    Tang, Yong
    Liu, Shuangyin
    Liu, Hai
    IEEE ACCESS, 2019, 7 : 92732 - 92744
  • [30] Enhancement of decomposed spectral coherence using sparse nonnegative matrix factorization
    Lee, Jeung-Hoon
    Mechanical Systems and Signal Processing, 2021, 157