Speech Enhancement of Complex Convolutional Recurrent Network with Attention

被引:0
|
作者
Jiangjiao Zeng
Lidong Yang
机构
[1] Inner Mongolia University Of Science and Technology,School of Information Engineering
关键词
Speech enhancement; Parameter-free attention module; Convolutional recurrent network; Bidirectional gated recurrent unit;
D O I
暂无
中图分类号
学科分类号
摘要
Speech enhancement aims to separate pure speech from noisy speech, to improve speech quality and intelligibility. A complex convolutional recurrent network with a parameter-free attention module is proposed to improve the effect of speech enhancement. First, the feature information is enhanced by improving the convolutional layer of the encoding layer and the decoding layer. Then, the redundant information is suppressed by adding a parameter-free attention module to extract features that are more effective for the speech enhancement task, and the middle layer is selected for the bidirectional gated recurrent unit. Compared with the best of several baseline models, in the Voice Bank + DEMAND dataset, Perceptual Evaluation of Speech Quality (PESQ) increased by 0.17 (6.23%), MOS predictor of intrusiveness of background noise (CBAK) increased by 0.14 (4.34%), (MOS predictor of overall processed speech quality) COVL increased by 0.40 (12.42%), and (MOS predictor of speech distortion) CSIG index increased by 0.57 (15.28%). Experimental results show that the proposed approach has higher theoretical significance and practical value for actual speech enhancement.
引用
收藏
页码:1834 / 1847
页数:13
相关论文
共 50 条
  • [31] DCTCN:Deep Complex Temporal Convolutional Network for Long Time Speech Enhancement
    Ren, Jigang
    Mao, Qirong
    [J]. INTERSPEECH 2022, 2022, : 5478 - 5482
  • [32] RECURRENT CONVOLUTIONAL NEURAL NETWORK FOR SPEECH PROCESSING
    Zhao, Yue
    Jin, Xingyu
    Hu, Xiaolin
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5300 - 5304
  • [33] CONVOLUTIONAL-RECURRENT NEURAL NETWORKS FOR SPEECH ENHANCEMENT
    Zhao, Han
    Zarar, Shuayb
    Tashev, Ivan
    Lee, Chin-Hui
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 2401 - 2405
  • [34] An optimized convolutional neural network for speech enhancement
    Karthik A.
    Mazher Iqbal J.L.
    [J]. International Journal of Speech Technology, 2023, 26 (04) : 1117 - 1129
  • [35] A Fully Convolutional Neural Network for Speech Enhancement
    Park, Se Rim
    Lee, Jin Won
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1993 - 1997
  • [36] Convolutional fusion network for monaural speech enhancement
    Xian, Yang
    Sun, Yang
    Wang, Wenwu
    Naqvi, Syed Mohsen
    [J]. NEURAL NETWORKS, 2021, 143 : 97 - 107
  • [37] MONAURAL SPEECH ENHANCEMENT WITH COMPLEX CONVOLUTIONAL BLOCK ATTENTION MODULE AND JOINT TIME FREQUENCY LOSSES
    Zhao, Shengkui
    Nguyen, Trung Hieu
    Ma, Bin
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6648 - 6652
  • [38] NSE-CATNet: Deep Neural Speech Enhancement Using Convolutional Attention Transformer Network
    Saleem, Nasir
    Gunawan, Teddy Surya
    Kartiwi, Mira
    Nugroho, Bambang Setia
    Wijayanto, Inung
    [J]. IEEE ACCESS, 2023, 11 : 66979 - 66994
  • [39] Convolutional Recurrent Smart Speech Enhancement Architecture for Hearing Aids
    Nossier, Soha A.
    Wall, Julie
    Moniri, Mansour
    Glackin, Cornelius
    Cannings, Nigel
    [J]. INTERSPEECH 2022, 2022, : 5428 - 5432
  • [40] RESIDUAL RECURRENT NEURAL NETWORK FOR SPEECH ENHANCEMENT
    Abdulbaqi, Jalal
    Gu, Yue
    Chen, Shuhong
    Marsic, Ivan
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6659 - 6663