Speech Enhancement Using a Two-Stage Network for an Efficient Boosting Strategy

被引:14
|
作者
Kim, Juntae [1 ]
Hahn, Minsoo [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Sch Elect Engn, Daejeon 34141, South Korea
基金
新加坡国家研究基金会;
关键词
Speech enhancement; two-stage network; NOISE;
D O I
10.1109/LSP.2019.2905660
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Anovel neural network architecture, called two-stage network (TSN), with a multi-objective learning (MOL) method for an efficient boosting strategy (BS) is proposed for speech enhancement. BS is an ensemble method using multiple base predictions (MBPs) for better final prediction. Because of the necessity for MBPs, the computational cost and model size of BS-based methods are greater than those of a single model. In overcoming this, TSN first obtains MBPs from a single deep neural network. Then, to obtain better final prediction, the convolution layers of TSN aggregate not only MBPs but also some auxiliary information such as contextual information, while adaptively filtering out some unnecessary information, e.g., poor base predictions. At the training phase, the MOL enables all stages of TSN to learn jointly, whereas allowing the TSN framework to embed a BS. Our experimental results confirm that the embedded BS leads TSN to outperform other baseline methods with a reasonably low computational cost and model size.
引用
收藏
页码:770 / 774
页数:5
相关论文
共 50 条
  • [1] TWO-STAGE SPEECH ENHANCEMENT USING GATED CONVOLUTIONS
    Thieling, Lars
    Jax, Peter
    [J]. 2022 INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC 2022), 2022,
  • [2] An efficient Bayesian network structure learning algorithm using the strategy of two-stage searches
    Guo, Huiping
    Li, Hongru
    [J]. INTELLIGENT DATA ANALYSIS, 2020, 24 (05) : 1087 - 1106
  • [3] A two-stage algorithm for enhancement of reverberant speech
    Wu, MY
    Wang, D
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 1085 - 1088
  • [4] A two-stage frequency-time dilated dense network for speech enhancement
    Huang, Xiangdong
    Chen, Honghong
    Lu, Wei
    [J]. APPLIED ACOUSTICS, 2022, 201
  • [5] A two-stage complex network using cycle-consistent generative adversarial networks for speech enhancement
    Yu, Guochen
    Wang, Yutian
    Wang, Hui
    Zhang, Qin
    Zheng, Chengshi
    [J]. SPEECH COMMUNICATION, 2021, 134 : 42 - 54
  • [6] A TWO-STAGE ALGORITHM FOR NOISY AND REVERBERANT SPEECH ENHANCEMENT
    Zhao, Yan
    Wang, Zhong-Qiu
    Wang, DeLiang
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5580 - 5584
  • [7] TWO-STAGE SPEECH ENHANCEMENT WITH MANIPULATION OF THE CEPSTRAL EXCITATION
    Elshamy, Samy
    Madhu, Nilesh
    Tirry, Wouter
    Fingscheidt, Tim
    [J]. 2017 HANDS-FREE SPEECH COMMUNICATIONS AND MICROPHONE ARRAYS (HSCMA 2017), 2017, : 106 - 110
  • [8] TSTNN: TWO-STAGE TRANSFORMER BASED NEURAL NETWORK FOR SPEECH ENHANCEMENT IN THE TIME DOMAIN
    Wan, Kai
    He, Bengbeng
    Zh, Wei-Ping
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7098 - 7102
  • [9] Investigations on the Optimal Estimation of Speech Envelopes for the Two-Stage Speech Enhancement
    Song, Yanjue
    Madhu, Nilesh
    [J]. SENSORS, 2023, 23 (14)
  • [10] A two-stage method for single-channel speech enhancement
    Hamid, ME
    Fukabayashi, T
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2006, E89A (04) : 1058 - 1068