An End-to-End Speech Enhancement Method Combining Attention Mechanism to Improve GAN

被引:0
|
作者
Chen, Wei [1 ]
Cai, Yichao [1 ]
Yang, Qingyu [1 ]
Wang, Ge [1 ]
Liu, Taian [1 ]
Liu, Xinying [1 ]
机构
[1] Shandong Univ Sci & Technol, Coll Intelligent Equipment, Tai An, Peoples R China
关键词
Generative Adversarial Networks; time series; attention mechanisms; SEGAN; PESQ; STOI; NOISE; SUPPRESSION; NETWORKS;
D O I
10.1109/IAEAC54830.2022.9929534
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Current Generative Adversarial Networks only rely on convolution operations when dealing with speech tasks, ignoring the dependencies between time series and have limited learning ability so that there is still obvious residual noise in the enhanced speech. To solve this problem, an end-to-end speech enhancement method combining attention mechanisms to improve GAN is proposed to apply a combined attention mechanism fusing channel and space between convolutional layers of SEGAN to obtain more contextual information of speech in both channel and space dimensions and extract more accurate feature information. Experimental results demonstrate that the method outperforms the baseline model in both speech quality and intelligibility. The experimental data show that under different signal-to-noise ratios, the perceptual speech quality assessment (PESQ) is improved by an average of 25.72%, and the objective short-term object intelligibility (STOI) is improved by an average of 1.68%.
引用
收藏
页码:538 / 542
页数:5
相关论文
共 50 条
  • [1] An End-to-end Speech Recognition Algorithm based on Attention Mechanism
    Chen, Jia-nan
    Gao, Shuang
    Sun, Han-zhe
    Liu, Xiao-hui
    Wang, Zi-ning
    Zheng, Yan
    PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 2935 - 2940
  • [2] Application of improved U-Net network with attention mechanism in end-to-end speech enhancement
    WU Ruiqin
    CHEN Xueqin
    YU Jie
    WANG Lirong
    ZHAO Heming
    Chinese Journal of Acoustics, 2022, 41 (04) : 390 - 403
  • [3] Application of improved U-Net network with attention mechanism in end-to-end speech enhancement
    Wu, Ruiqin
    Chen, Xueqin
    Yu, Jie
    Wang, Lirong
    Zhao, Heming
    Shengxue Xuebao/Acta Acustica, 2022, 47 (02): : 266 - 275
  • [4] An End-to-End Formula Recognition Method Integrated Attention Mechanism
    Zhou, Mingle
    Cai, Ming
    Li, Gang
    Li, Min
    MATHEMATICS, 2023, 11 (01)
  • [5] TRIGGERED ATTENTION FOR END-TO-END SPEECH RECOGNITION
    Moritz, Niko
    Hori, Takaaki
    Le Roux, Jonathan
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 5666 - 5670
  • [6] SPEECH ENHANCEMENT USING END-TO-END SPEECH RECOGNITION OBJECTIVES
    Subramanian, Aswin Shanmugam
    Wang, Xiaofei
    Baskar, Murali Karthick
    Watanabe, Shinji
    Taniguchi, Toru
    Tran, Dung
    Fujita, Yuya
    2019 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2019, : 234 - 238
  • [7] Efficient Conformer with Prob-Sparse Attention Mechanism for End-to-End Speech Recognition
    Wang, Xiong
    Sun, Sining
    Xie, Lei
    Ma, Long
    INTERSPEECH 2021, 2021, : 4578 - 4582
  • [8] Memory Attention: Robust Alignment Using Gating Mechanism for End-to-End Speech Synthesis
    Lee, Joun Yeop
    Cheon, Sung Jun
    Choi, Byoung Jin
    Kim, Nam Soo
    IEEE SIGNAL PROCESSING LETTERS, 2020, 27 : 2004 - 2008
  • [9] RefineNet-based End-to-end Speech Enhancement
    Lan T.
    Peng C.
    Li S.
    Qian Y.-X.
    Chen C.
    Liu Q.
    Zidonghua Xuebao/Acta Automatica Sinica, 2022, 48 (02): : 554 - 563
  • [10] End-to-End Mandarin Speech Recognition Combining CNN and BLSTM
    Wang, Dong
    Wang, Xiaodong
    Lv, Shaohe
    SYMMETRY-BASEL, 2019, 11 (05):