Multi-stage Progressive Speech Enhancement Network

被引:5
|
作者
Xu, Xinmeng [1 ,2 ]
Wang, Yang [1 ]
Xu, Dongxiang [1 ]
Peng, Yiyuan [1 ]
Zhang, Cong [1 ]
Jia, Jie [1 ]
Chen, Binbin [1 ]
机构
[1] Vivo AI Lab, Shenzhen, Peoples R China
[2] Trinity Coll Dublin, EE Engn, Dublin, Ireland
来源
关键词
speech enhancement; encoder-decoder convolutional network; channel attention; supervised attention; crossstage feature fusion; multi-stage progressive network; NEURAL-NETWORK; SEPARATION; NOISE;
D O I
10.21437/Interspeech.2021-520
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Speech enhancement is a fundamental way to separate and generate clean speech from adverse environment where the received speech is seriously corrupted by noise. This paper applies a novel progressive network for speech enhancement by using multi-stage structure, where each stage contains a channel attention block followed by dilated encoder-decoder convolutional network with gated linear units. In addition, each stage generates a prediction that is refined by a supervised attention block. What is more, a fusion block is inserted between original inputs and outputs of previous stage. Multi-stage architecture is introduced to sequentially invoke multiple deep-learning networks, and its key ingredient is the information exchange between different stages. Thus, a more flexible and robust outputs can be generated. Experimental results show that the proposed architecture obtains consistently better performance than recent state-of-the-art models in terms of both PESQ and STOI scores.
引用
收藏
页码:2691 / 2695
页数:5
相关论文
共 50 条
  • [1] Multi-stage attention network for monaural speech enhancement
    Wang, Kunpeng
    Lu, Wenjing
    Liu, Peng
    Yao, Juan
    Li, Huafeng
    [J]. IET SIGNAL PROCESSING, 2023, 17 (03)
  • [2] Multi-Stage Speech Enhancement for Automatic Speech Recognition
    Lee, Seungyeol
    Lee, Youngwoo
    Cho, Namgook
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2016,
  • [3] Multi-stage strength estimation network with cross attention for single channel speech enhancement
    Zhang, Zipeng
    Ding, Yuchen
    Chen, Wei
    Chen, Yutao
    Guo, Weiwei
    Liu, Houguang
    [J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (10) : 6937 - 6948
  • [4] MSPNet: Multi-stage progressive network for image denoising
    Bai, Yu
    Liu, Meiqin
    Yao, Chao
    Lin, Chunyu
    Zhao, Yao
    [J]. NEUROCOMPUTING, 2023, 517 : 71 - 80
  • [5] Recovery for underwater image degradation with multi-stage progressive enhancement
    Liu, Junnan
    Liu, Zhilin
    Wei, Yanhui
    Ouyang, Wenjia
    [J]. OPTICS EXPRESS, 2022, 30 (07) : 11704 - 11725
  • [6] Speech enhancement using microphone array with multi-stage processing
    Cao, YC
    Sridharan, S
    Moody, M
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 1996, E79A (03) : 386 - 394
  • [7] An Analysis of Multi-stage Progressive Image Restoration Network (MPRNet)
    Rajaei, Boshra
    Rajaei, Sara
    Damavandi, Hossein
    [J]. IMAGE PROCESSING ON LINE, 2023, 13 : 140 - 152
  • [8] Multi-stage coarse-to-fine progressive enhancement network for single-image HDR reconstruction
    Zhang, Wei
    Jiang, Gangyi
    Chen, Yeyao
    Xu, Haiyong
    Jiang, Hao
    Yu, Mei
    [J]. Displays, 2024, 84
  • [9] Exploring Multi-Stage GAN with Self-Attention for Speech Enhancement
    Asiedu Asante, Bismark Kweku
    Broni-Bediako, Clifford
    Imamura, Hiroki
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (16):
  • [10] Multi-stage Progressive Compression of Conformer Transducer for On-device Speech Recognition
    Rathod, Jash
    Dawalatabad, Nauman
    Singh, Shatrughan
    Gowda, Dhananjaya
    [J]. INTERSPEECH 2022, 2022, : 1691 - 1695