Supervised and Unsupervised Speech Enhancement Using Nonnegative Matrix Factorization

被引:328
|
作者
Mohammadiha, Nasser [1 ]
Smaragdis, Paris [2 ,3 ]
Leijon, Arne [1 ]
机构
[1] KTH Royal Inst Technol, Dept Elect Engn, SE-10044 Stockholm, Sweden
[2] Univ Illinois, Dept Comp Sci, Urbana, IL 61801 USA
[3] Univ Illinois, Dept Elect & Comp Engn, Urbana, IL 61801 USA
关键词
Bayesian inference; HMM; nonnegative matrix factorization (NMF); PLCA; speech enhancement; SQUARE ERROR ESTIMATION; NOISE; SEPARATION; SIGNALS;
D O I
10.1109/TASL.2013.2270369
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Reducing the interference noise in a monaural noisy speech signal has been a challenging task for many years. Compared to traditional unsupervised speech enhancement methods, e. g., Wiener filtering, supervised approaches, such as algorithms based on hidden Markov models (HMM), lead to higher-quality enhanced speech signals. However, the main practical difficulty of these approaches is that for each noise type a model is required to be trained a priori. In this paper, we investigate a new class of supervised speech denoising algorithms using nonnegative matrix factorization (NMF). We propose a novel speech enhancement method that is based on a Bayesian formulation of NMF (BNMF). To circumvent the mismatch problem between the training and testing stages, we propose two solutions. First, we use an HMM in combination with BNMF (BNMF-HMM) to derive a minimum mean square error (MMSE) estimator for the speech signal with no information about the underlying noise type. Second, we suggest a scheme to learn the required noise BNMF model online, which is then used to develop an unsupervised speech enhancement system. Extensive experiments are carried out to investigate the performance of the proposed methods under different conditions. Moreover, we compare the performance of the developed algorithms with state-of-the-art speech enhancement schemes using various objective measures. Our simulations show that the proposed BNMF-based methods outperform the competing algorithms substantially.
引用
收藏
页码:2140 / 2151
页数:12
相关论文
共 50 条
  • [41] Self-supervised Adaptive Kernel Nonnegative Matrix Factorization
    Deng, Furong
    Zhao, Yang
    Pei, Jihong
    Yang, Xuan
    PATTERN RECOGNITION AND COMPUTER VISION, PT I, PRCV 2022, 2022, 13534 : 161 - 173
  • [42] Supervised and Constrained Nonnegative Matrix Factorization with Sparseness for Image Representation
    Xibiao Cai
    Fuming Sun
    Wireless Personal Communications, 2018, 102 : 3055 - 3066
  • [43] Supervised Nonnegative Matrix Factorization to Predict ICU Mortality Risk
    Chao, Guoqing
    Mao, Chengsheng
    Wang, Fei
    Zhao, Yuan
    Luo, Yuan
    PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2018, : 1189 - 1194
  • [44] Supervised and Constrained Nonnegative Matrix Factorization with Sparseness for Image Representation
    Cai, Xibiao
    Sun, Fuming
    WIRELESS PERSONAL COMMUNICATIONS, 2018, 102 (04) : 3055 - 3066
  • [45] Semi-supervised Nonnegative Matrix Factorization with Commonness Extraction
    Teng, Yueyang
    Qi, Shouliang
    Dai, Yin
    Xu, Lisheng
    Qian, Wei
    Kang, Yan
    NEURAL PROCESSING LETTERS, 2017, 45 (03) : 1063 - 1076
  • [46] OVERLAPPING SOUND EVENT DETECTION WITH SUPERVISED NONNEGATIVE MATRIX FACTORIZATION
    Bisot, Victor
    Essid, Slim
    Richard, Gael
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 31 - 35
  • [47] Adaptive recurrent nonnegative matrix factorization with phase compensation for Single-Channel speech enhancement
    Tank, Vanita Raj
    Mahajan, Shrinivas Padmakar
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (20) : 28249 - 28294
  • [48] Adaptive recurrent nonnegative matrix factorization with phase compensation for Single-Channel speech enhancement
    Vanita Raj Tank
    Shrinivas Padmakar Mahajan
    Multimedia Tools and Applications, 2022, 81 : 28249 - 28294
  • [49] Semi-supervised Nonnegative Matrix Factorization with Commonness Extraction
    Yueyang Teng
    Shouliang Qi
    Yin Dai
    Lisheng Xu
    Wei Qian
    Yan Kang
    Neural Processing Letters, 2017, 45 : 1063 - 1076
  • [50] Solving consensus and semi-supervised clustering problems using nonnegative matrix factorization
    Li, Tao
    Ding, Chris
    Jordan, Michael I.
    ICDM 2007: PROCEEDINGS OF THE SEVENTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2007, : 577 - +