A Perceptually Motivated Approach for Speech Enhancement Based on Deep Neural Network

被引:2
|
作者
Han, Wei [1 ]
Zhang, Xiongwei [1 ]
Min, Gang [1 ,2 ]
Sun, Meng [1 ]
机构
[1] PLA Univ Sci & Technol, Lab Intelligence Informat Proc, Nanjing, Jiangsu, Peoples R China
[2] XIAN Commun Inst, Xian, Peoples R China
关键词
perceptually motivated; deep neural network; speech enhancement; masking residual noise; SEPARATION;
D O I
10.1587/transfun.E99.A.835
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this letter, a novel perceptually motivated single channel speech enhancement approach based on Deep Neural Network (DNN) is presented. Taking into account the good masking properties of the human auditory system, a new DNN architecture is proposed to reduce the perceptual effect of the residual noise. This new DNN architecture is directly trained to learn a gain function which is used to estimate the power spectrum of clean speech and shape the spectrum of the residual noise at the same time. Experimental results demonstrate that the proposed perceptually motivated speech enhancement approach could achieve better objective speech quality when tested with TIMIT sentences corrupted by various types of noise, no matter whether the noise conditions are included in the training set or not.
引用
收藏
页码:835 / 838
页数:4
相关论文
共 50 条
  • [1] A perceptually motivated approach for speech enhancement
    Hu, Y
    Loizou, PC
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (05): : 457 - 465
  • [2] A Perceptually Motivated Estimator for Speech Enhancement
    Montazeri, Vahid
    Khoubrouy, Soudeh A.
    Panahi, Issa M. S.
    [J]. 2013 8TH INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS (ISPA), 2013, : 366 - 370
  • [3] A new perceptually weighted cost function in deep neural network based speech enhancement systems
    Goli, Peyman
    [J]. HEARING BALANCE AND COMMUNICATION, 2019, 17 (03) : 191 - 196
  • [4] Speech enhancement based on perceptually motivated guided spectrogram filtering
    Wang, Jie
    Yan, Linhuang
    Yang, Qiaohe
    Yuan, Minmin
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (03) : 5443 - 5454
  • [5] Speech enhancement based on perceptually motivated Bayesian estimators of the magnitude spectrum
    Loizou, PC
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (05): : 857 - 869
  • [6] Perceptually motivated deep neural network for video compression artifact removal
    Ramsook, Darren
    Kokaram, Anil
    Birkbeck, Neil
    Su, Yeping
    Adsumilli, Balu
    [J]. APPLICATIONS OF DIGITAL IMAGE PROCESSING XLV, 2022, 12226
  • [7] PERCEPTUALLY GUIDED SPEECH ENHANCEMENT USING DEEP NEURAL NETWORKS
    Zhao, Yan
    Xu, Buye
    Giri, Ritwik
    Zhang, Tao
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5074 - 5078
  • [8] β -ORDER PERCEPTUALLY MOTIVATED MMSE ESTIMATION FOR SPEECH ENHANCEMENT
    Wang, Yue
    Cui, Jie
    Li, Ping
    Xiao, Ling
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON NETWORK INFRASTRUCTURE AND DIGITAL CONTENT, PROCEEDINGS, 2009, : 717 - 721
  • [9] Perceptually Motivated Generalized Spectral Subtraction for Speech Enhancement
    Zoghlami, Novlene
    Lachiri, Zied
    Ellouze, Noureddine
    [J]. ADVANCES IN NONLINEAR SPEECH PROCESSING, 2010, 5933 : 136 - 143
  • [10] Speech Enhancement based on Deep Convolutional Neural Network
    Nuthakki, Ramesh
    Masanta, Payel
    Yukta, T. N.
    [J]. PROCEEDINGS OF THE 2021 FIFTH INTERNATIONAL CONFERENCE ON I-SMAC (IOT IN SOCIAL, MOBILE, ANALYTICS AND CLOUD) (I-SMAC 2021), 2021, : 770 - 775