SESNet: A Speech Enhancement and Separation Network in Noisy Reverberant Environments

被引:0
|
作者
Wang, Liusong [1 ,2 ]
Gao, Yuan [1 ,2 ]
Cao, Kaimin [1 ,2 ]
Hu, Ying [1 ,2 ]
机构
[1] Xinjiang Univ, Sch Comp Sci & Technol, Urumqi, Peoples R China
[2] Key Lab Signal Detect & Proc Xinjiang, Urumqi, Peoples R China
关键词
Speech enhancement; Speech separation; Noisy reverberant environment; Former block;
D O I
10.1007/978-981-96-1045-7_4
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speech enhancement and separation in noisy reverberant environments are very challenging tasks. In this paper, we propose a speech enhancement and separation network, SESNet, for speech enhancement or speech separation in noisy reverberant environments, which is a multi-scale encoder-decoder architecture including a global-local feature extractor (GLFE). We also explored four kinds of Former blocks to be equipped in GLFE. We evaluate the performance of speech enhancement and speech separation on the VoiceBank+DEMAND and the WHAMR! datasets. The experimental results show that the SESNet has excellent performance for single- and multi-channel speech enhancement, and single-channel multi-speaker speech separation, keeping with a small model size.
引用
收藏
页码:44 / 54
页数:11
相关论文
共 50 条
  • [21] Humanoid separation of speech sources in reverberant environments
    Schulz, Sylvia
    Herfet, Thorsten
    2008 3RD INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS, CONTROL AND SIGNAL PROCESSING, VOLS 1-3, 2008, : 377 - 382
  • [22] Speech Enhancement and Recognition of Compressed Speech Signal in Noisy Reverberant Conditions
    Suman, Maloji
    Khan, Habibulla
    Latha, M. Madhavi
    Kumari, Devarakonda Aruna
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS 2012 (INDIA 2012), 2012, 132 : 379 - +
  • [23] Effects of urgent speech and preceding sounds on speech intelligibility in noisy and reverberant environments
    Hodoshima, Nao
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1696 - 1699
  • [24] A TWO-STAGE ALGORITHM FOR NOISY AND REVERBERANT SPEECH ENHANCEMENT
    Zhao, Yan
    Wang, Zhong-Qiu
    Wang, DeLiang
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5580 - 5584
  • [25] Maximum likelihood approach to speech enhancement for noisy reverberant signals
    Yoshioka, Takuya
    Nakatani, Tomohiro
    Hikichi, Takafumi
    Miyoshi, Masato
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4585 - 4588
  • [26] WHAMR!: NOISY AND REVERBERANT SINGLE-CHANNEL SPEECH SEPARATION
    Maciejewski, Matthew
    Wichern, Gordon
    McQuinn, Emmett
    Le Roux, Jonathan
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 696 - 700
  • [27] Speech Synthesis enhancement in noisy environments
    Bonardo, Davide
    Zovato, Enrico
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 789 - 792
  • [28] Speech Enhancement with Wide Residual Networks in Reverberant Environments
    Llombart, Jorge
    Ribas, Dayana
    Miguel, Antonio
    Vicente, Luis
    Ortega, Alfonso
    Lleida, Eduardo
    INTERSPEECH 2019, 2019, : 1811 - 1815
  • [29] PEVD-BASED SPEECH ENHANCEMENT IN REVERBERANT ENVIRONMENTS
    Neo, Vincent W.
    Evers, Christine
    Naylor, Patrick A.
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 186 - 190
  • [30] Speech enhancement applied to speech recognition in noisy environments
    Xu, Y.F., 2001, Press of Tsinghua University (41):