SESNet: A Speech Enhancement and Separation Network in Noisy Reverberant Environments

被引:0
|
作者
Wang, Liusong [1 ,2 ]
Gao, Yuan [1 ,2 ]
Cao, Kaimin [1 ,2 ]
Hu, Ying [1 ,2 ]
机构
[1] Xinjiang Univ, Sch Comp Sci & Technol, Urumqi, Peoples R China
[2] Key Lab Signal Detect & Proc Xinjiang, Urumqi, Peoples R China
关键词
Speech enhancement; Speech separation; Noisy reverberant environment; Former block;
D O I
10.1007/978-981-96-1045-7_4
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speech enhancement and separation in noisy reverberant environments are very challenging tasks. In this paper, we propose a speech enhancement and separation network, SESNet, for speech enhancement or speech separation in noisy reverberant environments, which is a multi-scale encoder-decoder architecture including a global-local feature extractor (GLFE). We also explored four kinds of Former blocks to be equipped in GLFE. We evaluate the performance of speech enhancement and speech separation on the VoiceBank+DEMAND and the WHAMR! datasets. The experimental results show that the SESNet has excellent performance for single- and multi-channel speech enhancement, and single-channel multi-speaker speech separation, keeping with a small model size.
引用
收藏
页码:44 / 54
页数:11
相关论文
共 50 条
  • [31] Identification of speech source coupling between sensors in reverberant noisy environments
    Cohen, I
    IEEE SIGNAL PROCESSING LETTERS, 2004, 11 (07) : 613 - 616
  • [32] SPEECH REINFORCEMENT IN NOISY REVERBERANT ENVIRONMENTS USING A PERCEPTUAL DISTORTION MEASURE
    Crespo, Joao B.
    Hendriks, Richard C.
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [33] RESTORATION OF INSTANTANEOUS AMPLITUDE AND PHASE OF SPEECH SIGNAL IN NOISY REVERBERANT ENVIRONMENTS
    Liu, Yang
    Nower, Naushin
    Yan, Yonghong
    Unoki, Masashi
    2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 879 - 883
  • [34] Neural Network-augmented Kalman Filtering for Robust Online Speech Dereverberation in Noisy Reverberant Environments
    Lemercier, Jean-Marie
    Thiemann, Joachim
    Koning, Raphael
    Gerkmann, Timo
    INTERSPEECH 2022, 2022, : 226 - 230
  • [35] Robust direction of arrival estimation for speech enhancement in noisy reverberant rooms
    Lotter, T
    Loellmann, HW
    Vary, P
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 4186 - 4186
  • [36] MAXIMUM LIKELIHOOD PSD ESTIMATION FOR SPEECH ENHANCEMENT IN REVERBERANT AND NOISY CONDITIONS
    Kuklasinski, Adam
    Doclo, Simon
    Jensen, Jesper
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 599 - 603
  • [37] Multi-branch Learning for Noisy and Reverberant Monaural Speech Separation
    Ma, Chao
    Li, Dongmei
    PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 1247 - 1251
  • [38] Enhancement of Noisy Reverberant Speech Using Polynomial Matrix Eigenvalue Decomposition
    Neo, Vincent W.
    Evers, Christine
    Naylor, Patrick A.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 3255 - 3266
  • [39] A robust speech enhancement method in noisy environments
    Abajaddi, Nesrine
    Mounir, Badia
    Elfahm, Youssef
    Farchi, Abdelmajid
    INTERNATIONAL JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING SYSTEMS, 2023, 14 (09) : 973 - 983
  • [40] WHAM!: Extending Speech Separation to Noisy Environments
    Wichern, Gordon
    Antognini, Joe
    Flynn, Michael
    Zhu, Licheng Richard
    McQuinn, Emmett
    Crow, Dwight
    Manilow, Ethan
    Le Roux, Jonathan
    INTERSPEECH 2019, 2019, : 1368 - 1372