REDUNDANT CONVOLUTIONAL NETWORK WITH ATTENTION MECHANISM FOR MONAURAL SPEECH ENHANCEMENT

被引:0
|
作者
Lan, Tian [1 ,2 ]
Lyu, Yilan [1 ]
Hui, Guoqiang [1 ]
Mokhosi, Refuoe [1 ]
Li, Sen [1 ]
Liu, Qiao [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Informat & Software Engn, Chengdu, Sichuan, Peoples R China
[2] CETC Big Data Res Inst Co Ltd, Guiyang, Peoples R China
基金
美国国家科学基金会;
关键词
Squeeze-and-Excitation; convolutional encoder-decoder; speech enhancement; fully convolutional network; attention mechanism; NEURAL-NETWORK;
D O I
10.1109/icassp40776.2020.9053277
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The redundant convolutional encoder-decoder network has proven useful in speech enhancement tasks. It can capture localized time-frequency details of speech signals through both the fully convolutional network structure and feature selection capability resulting from the encoder-decoder mechanism. However, it does not explicitly consider the signal filtering mechanism, which we regard as important for speech enhancement models. In this study, we introduce an attention mechanism into the convolutional encoder-decoder model. This mechanism adaptively filters channel-wise feature responses by explicitly modeling attentions (on speech versus noise signals) between channels. Experimental results show that the proposed attention model is effective in capturing speech signals from background noise, and performs especially better in unseen noise conditions compared to other state-of-the-art models.
引用
收藏
页码:6654 / 6658
页数:5
相关论文
共 50 条
  • [1] An Attention-augmented Fully Convolutional Neural Network for Monaural Speech Enhancement
    Xu, Zezheng
    Jiang, Ting
    Li, Chao
    Yu, Jiacheng
    [J]. 2021 12TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2021,
  • [2] Convolutional fusion network for monaural speech enhancement
    Xian, Yang
    Sun, Yang
    Wang, Wenwu
    Naqvi, Syed Mohsen
    [J]. NEURAL NETWORKS, 2021, 143 : 97 - 107
  • [3] Combining Multi-Perspective Attention Mechanism With Convolutional Networks for Monaural Speech Enhancement
    Lan, Tian
    Lyu, Yilan
    Ye, Wenzheng
    Hui, Guoqiang
    Xu, Zenglin
    Liu, Qiao
    [J]. IEEE ACCESS, 2020, 8 : 78979 - 78991
  • [4] Supervised Attention Multi-Scale Temporal Convolutional Network for monaural speech enhancement
    Zhang, Zehua
    Zhang, Lu
    Zhuang, Xuyi
    Qian, Yukun
    Wang, Mingjiang
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2024, 2024 (01)
  • [5] A Recursive Network with Dynamic Attention for Monaural Speech Enhancement
    Li, Andong
    Zheng, Chengshi
    Fan, Cunhang
    Peng, Renhua
    Li, Xiaodong
    [J]. INTERSPEECH 2020, 2020, : 2422 - 2426
  • [6] Dilated convolutional recurrent neural network for monaural speech enhancement
    Pirhosseinloo, Shadi
    Brumberg, Jonathan S.
    [J]. CONFERENCE RECORD OF THE 2019 FIFTY-THIRD ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, 2019, : 158 - 162
  • [7] Multi-stage attention network for monaural speech enhancement
    Wang, Kunpeng
    Lu, Wenjing
    Liu, Peng
    Yao, Juan
    Li, Huafeng
    [J]. IET SIGNAL PROCESSING, 2023, 17 (03)
  • [8] Harmonic Attention for Monaural Speech Enhancement
    Wang, Tianrui
    Zhu, Weibin
    Gao, Yingying
    Zhang, Shilei
    Feng, Junlan
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 2424 - 2436
  • [9] COMPLEX SPECTRAL MAPPING WITH A CONVOLUTIONAL RECURRENT NETWORK FOR MONAURAL SPEECH ENHANCEMENT
    Tan, Ke
    Wang, DeLiang
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6865 - 6869
  • [10] Embedding Encoder-Decoder With Attention Mechanism for Monaural Speech Enhancement
    Lan, Tian
    Ye, Wenzheng
    Lyu, Yilan
    Zhang, Junyi
    Liu, Qiao
    [J]. IEEE ACCESS, 2020, 8 : 96677 - 96685