Speech Enhancement of Complex Convolutional Recurrent Network with Attention

被引:0
|
作者
Jiangjiao Zeng
Lidong Yang
机构
[1] Inner Mongolia University Of Science and Technology,School of Information Engineering
关键词
Speech enhancement; Parameter-free attention module; Convolutional recurrent network; Bidirectional gated recurrent unit;
D O I
暂无
中图分类号
学科分类号
摘要
Speech enhancement aims to separate pure speech from noisy speech, to improve speech quality and intelligibility. A complex convolutional recurrent network with a parameter-free attention module is proposed to improve the effect of speech enhancement. First, the feature information is enhanced by improving the convolutional layer of the encoding layer and the decoding layer. Then, the redundant information is suppressed by adding a parameter-free attention module to extract features that are more effective for the speech enhancement task, and the middle layer is selected for the bidirectional gated recurrent unit. Compared with the best of several baseline models, in the Voice Bank + DEMAND dataset, Perceptual Evaluation of Speech Quality (PESQ) increased by 0.17 (6.23%), MOS predictor of intrusiveness of background noise (CBAK) increased by 0.14 (4.34%), (MOS predictor of overall processed speech quality) COVL increased by 0.40 (12.42%), and (MOS predictor of speech distortion) CSIG index increased by 0.57 (15.28%). Experimental results show that the proposed approach has higher theoretical significance and practical value for actual speech enhancement.
引用
收藏
页码:1834 / 1847
页数:13
相关论文
共 50 条
  • [1] Speech Enhancement of Complex Convolutional Recurrent Network with Attention
    Zeng, Jiangjiao
    Yang, Lidong
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2022, 42 (3) : 1834 - 1847
  • [2] COMPLEX SPECTRAL MAPPING WITH A CONVOLUTIONAL RECURRENT NETWORK FOR MONAURAL SPEECH ENHANCEMENT
    Tan, Ke
    Wang, DeLiang
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6865 - 6869
  • [3] A Convolutional Gated Recurrent Network for Speech Enhancement
    Yuan, Wen-Hao
    Hu, Shao-Dong
    Shi, Yun-Long
    Li, Zhao
    Liang, Chun-Yan
    [J]. Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2020, 48 (07): : 1276 - 1283
  • [4] DEEP COMPLEX CONVOLUTIONAL RECURRENT NETWORK FOR MULTI-CHANNEL SPEECH ENHANCEMENT AND DEREVERBERATION
    Gelderblom, Femke B.
    Myrvoll, Tor Andre
    [J]. 2021 IEEE 31ST INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2021,
  • [5] REDUNDANT CONVOLUTIONAL NETWORK WITH ATTENTION MECHANISM FOR MONAURAL SPEECH ENHANCEMENT
    Lan, Tian
    Lyu, Yilan
    Hui, Guoqiang
    Mokhosi, Refuoe
    Li, Sen
    Liu, Qiao
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6654 - 6658
  • [6] Dilated convolutional recurrent neural network for monaural speech enhancement
    Pirhosseinloo, Shadi
    Brumberg, Jonathan S.
    [J]. CONFERENCE RECORD OF THE 2019 FIFTY-THIRD ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, 2019, : 158 - 162
  • [7] Environment-dependent Attention-driven Recurrent Convolutional Neural Network for Robust Speech Enhancement
    Ge, Meng
    Wang, Longbiao
    Li, Nan
    Shi, Hao
    Dang, Jianwu
    Li, Xiangang
    [J]. INTERSPEECH 2019, 2019, : 3153 - 3157
  • [8] Temporal Convolutional Network with Frequency Dimension Adaptive Attention for Speech Enhancement
    Zhang, Qiquan
    Song, Qi
    Nicolson, Aaron
    Lan, Tian
    Li, Haizhou
    [J]. INTERSPEECH 2021, 2021, : 166 - 170
  • [9] Scale-aware dual-branch complex convolutional recurrent network for monaural speech enhancement
    Li, Yihao
    Sun, Meng
    Zhang, Xiongwei
    Van Hamme, Hugo
    [J]. COMPUTER SPEECH AND LANGUAGE, 2024, 86
  • [10] Complex-valued temporal convolutional network for speech enhancement
    Song, Jiaqi
    Zou, Lian
    Zhou, Liqing
    Liu, Ziao
    Fan, Cien
    Wang, Bin
    [J]. INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2024, 22 (05)