Speech enhancement method based on the perceptual joint optimization deep neural network

被引:0
|
作者
Yuan W. [1 ]
Lou Y. [1 ]
Liang C. [1 ]
Wang Z. [1 ]
机构
[1] College of Computer Science and Technology, Shandong University of Technology, Zibo
关键词
Correlation; Cost function; Deep neural network; Speech enhancement;
D O I
10.19665/j.issn1001-2400.2019.02.015
中图分类号
学科分类号
摘要
In the training of speech enhancement models based on the deep neural network (DNN), the mean square error is generally adopted as the cost function, which is not optimized for the speech enhancement problem. In view of this problem, to consider the correlation between the adjacent frames of the network's output and the presence of the speech component in each time-frequency unit, by correlating the adjacent frames of the network's output and designing a perceptual coefficient related to the presence of the speech component in time-frequency units in the cost function, a speech enhancement method based on the joint optimization DNN is proposed. Experimental results show that compared with the speech enhancement method based on the mean square error, the proposed method significantly improves the quality and intelligibility of the enhanced speech and has a better speech enhancement performance. © 2019, The Editorial Board of Journal of Xidian University. All right reserved.
引用
收藏
页码:90 / 94
页数:4
相关论文
共 50 条
  • [41] A Classification Retrieval Method for Encrypted Speech Based on Deep Neural Network and Deep Hashing
    Zhang, Qiuyu
    Zhao, Xuejiao
    Hu, Yingjie
    [J]. IEEE ACCESS, 2020, 8 : 202469 - 202482
  • [42] Deep Convolutional Neural Network-based Speech Signal Enhancement Using Extensive Speech Features
    Garg, Anil
    Sahu, O. P.
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL METHODS, 2022, 19 (08)
  • [43] Convolutional Deep Neural Network and Full Connectivity for Speech Enhancement
    Alameri, Ban M.
    Kadhim, Inas Jawad
    Hadi, Suha Qasim
    Hassoon, Ali F.
    Abd, Mustafa M.
    Premaratne, Prashan
    [J]. INTERNATIONAL JOURNAL OF ONLINE AND BIOMEDICAL ENGINEERING, 2023, 19 (04) : 140 - 154
  • [44] Speech Enhancement for Optical Laser Microphone With Deep Neural Network
    Cai, Chengkai
    Iwai, Kenta
    Nishiura, Takanobu
    Yamashita, Yoichi
    [J]. 2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2020, : 449 - 454
  • [45] Deep neural network-based linear predictive parameter estimations for speech enhancement
    Li, Yaxing
    Kang, Sangwon
    [J]. IET SIGNAL PROCESSING, 2017, 11 (04) : 469 - 476
  • [46] A Reduced Complexity MFCC-based Deep Neural Network Approach for Speech Enhancement
    Razani, Ryan
    Chung, Hanwook
    Attabi, Yazid
    Champagne, Benoit
    [J]. 2017 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), 2017, : 331 - 336
  • [47] Cross-language Transfer Learning for Deep Neural Network Based Speech Enhancement
    Xu, Yong
    Du, Jun
    Dai, Li-Rong
    Lee, Chin-Hui
    [J]. 2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 336 - +
  • [48] Optimization of Deep Neural Network (DNN) Speech Coder Using a Multi Time Scale Perceptual Loss Function
    Byun, Joon
    Shin, Seungmin
    Sung, Jongmo
    Beack, Seungkwon
    Park, Youngcheol
    [J]. INTERSPEECH 2022, 2022, : 4411 - 4415
  • [49] An Improved Supervised Speech Separation Method Based on Perceptual Weighted Deep Recurrent Neural Networks
    Han, Wei
    Zhang, Xiongwei
    Sun, Meng
    Li, Li
    Shi, Wenhua
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2017, E100A (02) : 718 - 721
  • [50] Speech enhancement from fused features based on deep neural network and gated recurrent unit network
    Wang, Youming
    Han, Jiali
    Zhang, Tianqi
    Qing, Didi
    [J]. EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2021, 2021 (01)