Speech enhancement method based on the perceptual joint optimization deep neural network

被引:0
|
作者
Yuan, Wenhao [1 ]
Lou, Yingxi [1 ]
Liang, Chunyan [1 ]
Wang, Zhiqiang [1 ]
机构
[1] College of Computer Science and Technology, Shandong University of Technology, Zibo,255000, China
关键词
Mean square error - Speech enhancement - Cost functions - Speech intelligibility;
D O I
10.19665/j.issn1001-2400.2019.02.015
中图分类号
学科分类号
摘要
In the training of speech enhancement models based on the deep neural network (DNN), the mean square error is generally adopted as the cost function, which is not optimized for the speech enhancement problem. In view of this problem, to consider the correlation between the adjacent frames of the network's output and the presence of the speech component in each time-frequency unit, by correlating the adjacent frames of the network's output and designing a perceptual coefficient related to the presence of the speech component in time-frequency units in the cost function, a speech enhancement method based on the joint optimization DNN is proposed. Experimental results show that compared with the speech enhancement method based on the mean square error, the proposed method significantly improves the quality and intelligibility of the enhanced speech and has a better speech enhancement performance. © 2019, The Editorial Board of Journal of Xidian University. All right reserved.
引用
收藏
页码:90 / 94
相关论文
共 50 条
  • [1] An optimization method for speech enhancement based on deep neural network
    Sun, Haixia
    Li, Sikun
    [J]. 3RD INTERNATIONAL CONFERENCE ON ADVANCES IN ENERGY, ENVIRONMENT AND CHEMICAL ENGINEERING, 2017, 69
  • [2] Joint Optimization of Perceptual Gain Function and Deep Neural Networks for Single-Channel Speech Enhancement
    Han, Wei
    Zhang, Xiongwei
    Min, Gang
    Zhou, Xingyu
    Sun, Meng
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2017, E100A (02) : 714 - 717
  • [3] A Single-channel Speech Enhancement Approach Based on Perceptual Masking Deep Neural Network
    [J]. Zhang, Xiong-Wei (xwzhang9898@163.com), 2017, Science Press (43):
  • [4] Improved Sparse NMF based Speech Enhancement Method with Deep Neural Network
    Zou, Xia
    Zhang, Xiongwei
    Shi, Wenhua
    Wang, Fupeng
    Zhang, Jingtao
    Gao, Mingyue
    [J]. PROCEEDINGS OF THE 2ND INTERNATIONAL FORUM ON MANAGEMENT, EDUCATION AND INFORMATION TECHNOLOGY APPLICATION (IFMEITA 2017), 2017, 130 : 231 - 234
  • [5] Speech Enhancement based on Deep Convolutional Neural Network
    Nuthakki, Ramesh
    Masanta, Payel
    Yukta, T. N.
    [J]. PROCEEDINGS OF THE 2021 FIFTH INTERNATIONAL CONFERENCE ON I-SMAC (IOT IN SOCIAL, MOBILE, ANALYTICS AND CLOUD) (I-SMAC 2021), 2021, : 770 - 775
  • [6] Supervised speech enhancement based on deep neural network
    Saleem, Nasir
    Khattak, Muhammad Irfan
    Qazi, Abdul Baser
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 37 (04) : 5187 - 5201
  • [7] A Novel Single Channel Speech Enhancement Based on Joint Deep Neural Network and Wiener Filter
    Han, Wei
    Zhang, Xiongwei
    Min, Gang
    Zhou, Xingyu
    [J]. PROCEEDINGS OF 2015 IEEE INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATCS AND COMPUTING (IEEE PIC), 2015, : 163 - 167
  • [8] Monaural speech enhancement combining deep neural network and convex optimization
    ZHANG Xiaoyan
    ZHANG Tianqi
    GE Wanying
    BAI Yangliu
    [J]. Chinese Journal of Acoustics, 2021, 40 (03) : 460 - 476
  • [9] Speech enhancement based on noise classification and deep neural network
    Wang, Wenbo
    Liu, Houguang
    Yang, Jianhua
    Cao, Guohua
    Hua, Chunli
    [J]. MODERN PHYSICS LETTERS B, 2019, 33 (17):
  • [10] Speech Enhancement Method Based On LSTM Neural Network for Speech Recognition
    Liu, Ming
    Wang, Yujun
    Wang, Jin
    Wang, Jing
    Xie, Xiang
    [J]. PROCEEDINGS OF 2018 14TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2018, : 245 - 249