Perceptual Weighting Deep Neural Networks for Single-channel Speech Enhancement

被引:0
|
作者
Han, Wei [1 ]
Zhang, Xiongwei [1 ]
Min, Gang [1 ,2 ]
Zhou, Xingyu [1 ]
Zhang, Wei [3 ]
机构
[1] PLA Univ Sci & Technol, Nanjing 210007, Jiangsu, Peoples R China
[2] XIAN Commun Inst, Xian 710106, Peoples R China
[3] China Satellite Maritime Tracking & Controlling D, Jiangyin 214400, Peoples R China
关键词
AUTO-ENCODER; LOW-RANK; NOISE; SPARSE;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Improving the perceptual quality of speech signals is a key yet challenging problem for many real world applications. In this paper, we propose a perceptually motivated approach based on deep neural networks (DNNs) for speech enhancement task. The proposed approach take into consider the masking properties of the human auditory system and reduces the perceptual effect of the residual noise. Given the good performance of deep learning in signal representation, a DNN is employed for accurately modeling the clean speech spectrum. In the DNN training stage, perceptual weighting matrix is used to adjust the weight of the error when using back propagation algorithm transfer the error from DNN output layer to the front layer. Evaluations on various real-world background noises show that the proposed method performs better than different competitive methods.
引用
收藏
页码:446 / 450
页数:5
相关论文
共 50 条
  • [1] Joint Optimization of Perceptual Gain Function and Deep Neural Networks for Single-Channel Speech Enhancement
    Han, Wei
    Zhang, Xiongwei
    Min, Gang
    Zhou, Xingyu
    Sun, Meng
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2017, E100A (02) : 714 - 717
  • [2] A Single-channel Speech Enhancement Approach Based on Perceptual Masking Deep Neural Network
    [J]. Zhang, Xiong-Wei (xwzhang9898@163.com), 2017, Science Press (43):
  • [3] Deep Neural Network for Supervised Single-Channel Speech Enhancement
    Saleem, Nasir
    Irfan Khattak, Muhammad
    Ali, Muhammad Yousaf
    Shafi, Muhammad
    [J]. ARCHIVES OF ACOUSTICS, 2019, 44 (01) : 3 - 12
  • [4] JOINT OPTIMIZATION OF AUDIBLE NOISE SUPPRESSION AND DEEP NEURAL NETWORKS FOR SINGLE-CHANNEL SPEECH ENHANCEMENT
    Han, Wei
    Zhang, Xiongwei
    Min, Gang
    Sun, Meng
    Yang, Jibin
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO (ICME), 2016,
  • [5] SINGLE-CHANNEL MIXED SPEECH RECOGNITION USING DEEP NEURAL NETWORKS
    Weng, Chao
    Yu, Dong
    Seltzer, Michael L.
    Droppo, Jasha
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [6] Kernel Machines Beat Deep Neural Networks on Mask-based Single-channel Speech Enhancement
    Hui, Like
    Ma, Siyuan
    Belkin, Mikhail
    [J]. INTERSPEECH 2019, 2019, : 2748 - 2752
  • [7] GAUSSIAN DENSITY GUIDED DEEP NEURAL NETWORK FOR SINGLE-CHANNEL SPEECH ENHANCEMENT
    Chai, Li
    Du, Jun
    Wang, Yan-nan
    [J]. 2017 IEEE 27TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING, 2017,
  • [8] NOISE-ADAPTIVE DEEP NEURAL NETWORK FOR SINGLE-CHANNEL SPEECH ENHANCEMENT
    Chung, Hanwook
    Kim, Taesup
    Plourde, Eric
    Champagne, Benoit
    [J]. 2018 IEEE 28TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2018,
  • [9] Single channel speech enhancement based on perceptual frequency-weighting
    Hayashi, Seiji
    Suguimoto, Masahiro
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2007, E90D (06) : 998 - 1001
  • [10] Deep Neural Networks for Single-Channel Multi-Talker Speech Recognition
    Weng, Chao
    Yu, Dong
    Seltzer, Michael L.
    Droppo, Jasha
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (10) : 1670 - 1679