Prediction of NMF-based Wiener Filter for Speech Enhancement Using Deep Neural Networks

被引:0
|
作者
Bai, Zhigang [1 ]
Bao, Changchun [1 ]
Cui, Zihao [1 ]
机构
[1] Beijing Univ Technol, Speech & Audio Signal Proc Lab, Fac Informat Technol, Beijing 100124, Peoples R China
基金
中国国家自然科学基金;
关键词
speech enhancement; nonnegative matrix factorization; deep neural networks; NMF-based Wiener filter;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, a novel approach is presented to predict a training target called NMF-based Wiener filter using deep neural networks (DNN) in the nonnegative matrix factorization (NMF) based speech enhancement. The NMF-based Wiener filter, as a masking-based target, is easier than the encoding vectors used in previous algorithms for parameter estimation. The intermediate error of the NMF-based speech enhancement process was reduced due to direct prediction of the NMF-based Wiener filter. The encoding vectors of noisy speech were extracted with the NMF algorithm and normalized to obtain more discriminative input features. The DNN was trained to learn a nonlinear mapping from the encoding vector of noisy speech to the NMF-based Wiener filter. At test stage, the predicted NMF-based Wiener filter was used to enhance noisy speech. The objective evaluations demonstrated that the proposed algorithm outperforms some existing NMF-based and DNN-based methods at various input signal-to-noise ratio (SNR) levels.
引用
收藏
页数:5
相关论文
共 50 条
  • [31] Target Speech Signal Enhancement Based on Deep Neural Networks
    Zhang, Xin
    Wang, MingJiang
    Xuan, XiaoGuang
    Sun, FengJiao
    2019 2ND IEEE INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND SIGNAL PROCESSING (ICICSP), 2019, : 241 - 245
  • [32] A Novel Approach to Speech Enhancement Based on Deep Neural Networks
    Salehi, Maryam
    Mirzakuchaki, Sattar
    ADVANCES IN ELECTRICAL AND COMPUTER ENGINEERING, 2022, 22 (02) : 71 - 78
  • [33] DCT and Wiener filter based on approach for speech enhancement using a single microphone
    Laboratory of Information Science, College of Communication Engineering, Jilin University, Changchun 130012, China
    Tongxin Xuebao, 2006, 10 (86-93):
  • [34] ADAPTIVE WIENER FILTER FOR SPEECH ENHANCEMENT
    Yelwande, Aishwarya
    Kansal, Sarita
    Dixit, Ansha
    2017 IEEE INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATION, INSTRUMENTATION AND CONTROL (ICICIC), 2017,
  • [35] Speech enhancement with an adaptive Wiener filter
    Abd El-Fattah, Marwa
    Dessouky, Moawad
    Abbas, Alaa
    Diab, Salaheldin
    El-Rabaie, El-Sayed
    Al-Nuaimy, Waleed
    Alshebeili, Saleh
    Abd El-Samie, Fathi
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2014, 17 (01) : 53 - 64
  • [36] ROTATIONAL RESET STRATEGY FOR ONLINE SEMI-SUPERVISED NMF-BASED SPEECH ENHANCEMENT FOR LONG RECORDINGS
    Zhou, Jun
    Chen, Shuo
    Duan, Zhiyao
    2015 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2015,
  • [37] SPEECH ENHANCEMENT USING IMPROVED MAP ESTIMATION AND WIENER FILTER
    Chehrehsa, Sarang
    Moir, Tom
    PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), 2016, : 494 - 498
  • [38] COMBINING SPARSE NMF WITH DEEP NEURAL NETWORK: A NEW CLASSIFICATION-BASED APPROACH FOR SPEECH ENHANCEMENT
    Tseng, Hung-Wei
    Hong, Mingyi
    Luo, Zhi-Quan
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 2145 - 2149
  • [39] Audio-Visual Speech Enhancement using Deep Neural Networks
    Hou, Jen-Cheng
    Wang, Syu-Siang
    Lai, Ying-Hui
    Lin, Jen-Chun
    Tsao, Yu
    Chang, Hsiu-Wen
    Wang, Hsin-Min
    2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
  • [40] Speech Enhancement for Speaker Recognition Using Deep Recurrent Neural Networks
    Tkachenko, Maxim
    Yamshinin, Alexander
    Lyubimov, Nikolay
    Kotov, Mikhail
    Nastasenko, Marina
    SPEECH AND COMPUTER, SPECOM 2017, 2017, 10458 : 690 - 699