Prediction of NMF-based Wiener Filter for Speech Enhancement Using Deep Neural Networks

被引:0
|
作者
Bai, Zhigang [1 ]
Bao, Changchun [1 ]
Cui, Zihao [1 ]
机构
[1] Beijing Univ Technol, Speech & Audio Signal Proc Lab, Fac Informat Technol, Beijing 100124, Peoples R China
基金
中国国家自然科学基金;
关键词
speech enhancement; nonnegative matrix factorization; deep neural networks; NMF-based Wiener filter;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, a novel approach is presented to predict a training target called NMF-based Wiener filter using deep neural networks (DNN) in the nonnegative matrix factorization (NMF) based speech enhancement. The NMF-based Wiener filter, as a masking-based target, is easier than the encoding vectors used in previous algorithms for parameter estimation. The intermediate error of the NMF-based speech enhancement process was reduced due to direct prediction of the NMF-based Wiener filter. The encoding vectors of noisy speech were extracted with the NMF algorithm and normalized to obtain more discriminative input features. The DNN was trained to learn a nonlinear mapping from the encoding vector of noisy speech to the NMF-based Wiener filter. At test stage, the predicted NMF-based Wiener filter was used to enhance noisy speech. The objective evaluations demonstrated that the proposed algorithm outperforms some existing NMF-based and DNN-based methods at various input signal-to-noise ratio (SNR) levels.
引用
收藏
页数:5
相关论文
共 50 条
  • [21] Improvement of Wiener Filter based Speech Enhancement using Compressive Sensing
    Sulong, Amart
    Kadir, Kushsairy
    Gunawan, Teddy S.
    Khalifa, Othman O.
    2014 IEEE INTERNATIONAL CONFERENCE ON SMART INSTRUMENTATION, MEASUREMENT AND APPLICATIONS (ICSIMA), 2014,
  • [22] A multi-channel speech enhancement framework for robust NMF-based speech recognition for speech-impaired users
    Dekkers, Gert
    van Waterschoot, Toon
    Vanrumste, Bart
    Van Den Broeck, Bert
    Gemmeke, Jort F.
    Van Hamme, Hugo
    Karsmakers, Peter
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 746 - 750
  • [23] Speech Enhancement by Classification of Noisy Signals Decomposed Using NMF and Wiener Filtering
    Fakhry, Mahmoud
    Poorjam, Amir Hossein
    Christensen, Mads Graesboll
    2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 16 - 20
  • [24] Noise Reduction and Speech Enhancement Using Wiener Filter
    Nuha, Hilal H.
    Absa, Ahmad Abo
    2022 INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ITS APPLICATIONS (ICODSA), 2022, : 177 - 180
  • [25] Speech Enhancement Based on the Wiener Filter and Wavelet Entropy
    Jiao, Mingke
    Lou, Lin
    Geng, Xiliang
    Wang, Zhongming
    Zhang, Peng
    Liao, Xijiang
    Zhang, Wenyuan
    2015 12TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2015, : 1956 - 1960
  • [26] PERCEPTUALLY GUIDED SPEECH ENHANCEMENT USING DEEP NEURAL NETWORKS
    Zhao, Yan
    Xu, Buye
    Giri, Ritwik
    Zhang, Tao
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5074 - 5078
  • [27] Genetic Algorithm-Based Adaptive Wiener Gain for Speech Enhancement Using an Iterative Posterior NMF
    Yechuri, Sivaramakrishna
    Vanabathina, Sunny Dayal
    INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2023, 23 (06)
  • [28] An Experimental Study on Speech Enhancement Based on Deep Neural Networks
    Xu, Yong
    Du, Jun
    Dai, Li-Rong
    Lee, Chin-Hui
    IEEE SIGNAL PROCESSING LETTERS, 2014, 21 (01) : 65 - 68
  • [29] A Regression Approach to Speech Enhancement Based on Deep Neural Networks
    Xu, Yong
    Du, Jun
    Dai, Li-Rong
    Lee, Chin-Hui
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (01) : 7 - 19
  • [30] SPEECH ENHANCEMENT BASED ON DEEP NEURAL NETWORKS WITH SKIP CONNECTIONS
    Tu, Ming
    Zhang, Xianxian
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5565 - 5569