Speech Enhancement Using a Two-Stage Network for an Efficient Boosting Strategy

被引：14

作者：

Kim, Juntae ^{[1
]}

Hahn, Minsoo ^{[1
]}

机构：

[1] Korea Adv Inst Sci & Technol, Sch Elect Engn, Daejeon 34141, South Korea

来源：

IEEE SIGNAL PROCESSING LETTERS | 2019年 / 26卷 / 05期

基金：

新加坡国家研究基金会;

关键词：

Speech enhancement; two-stage network; NOISE;

D O I：

10.1109/LSP.2019.2905660

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Anovel neural network architecture, called two-stage network (TSN), with a multi-objective learning (MOL) method for an efficient boosting strategy (BS) is proposed for speech enhancement. BS is an ensemble method using multiple base predictions (MBPs) for better final prediction. Because of the necessity for MBPs, the computational cost and model size of BS-based methods are greater than those of a single model. In overcoming this, TSN first obtains MBPs from a single deep neural network. Then, to obtain better final prediction, the convolution layers of TSN aggregate not only MBPs but also some auxiliary information such as contextual information, while adaptively filtering out some unnecessary information, e.g., poor base predictions. At the training phase, the MOL enables all stages of TSN to learn jointly, whereas allowing the TSN framework to embed a BS. Our experimental results confirm that the embedded BS leads TSN to outperform other baseline methods with a reasonably low computational cost and model size.

引用

页码：770 / 774

页数：5

共 50 条

[1] TWO-STAGE SPEECH ENHANCEMENT USING GATED CONVOLUTIONS
Thieling, Lars
Jax, Peter
[J]. 2022 INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC 2022), 2022,
[2] An efficient Bayesian network structure learning algorithm using the strategy of two-stage searches
Guo, Huiping
Li, Hongru
[J]. INTELLIGENT DATA ANALYSIS, 2020, 24 (05) : 1087 - 1106
[3] A two-stage algorithm for enhancement of reverberant speech
Wu, MY
Wang, D
[J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 1085 - 1088
[4] A two-stage frequency-time dilated dense network for speech enhancement
Huang, Xiangdong
Chen, Honghong
Lu, Wei
[J]. APPLIED ACOUSTICS, 2022, 201
[5] A two-stage complex network using cycle-consistent generative adversarial networks for speech enhancement
Yu, Guochen
Wang, Yutian
Wang, Hui
Zhang, Qin
Zheng, Chengshi
[J]. SPEECH COMMUNICATION, 2021, 134 : 42 - 54
[6] A TWO-STAGE ALGORITHM FOR NOISY AND REVERBERANT SPEECH ENHANCEMENT
Zhao, Yan
Wang, Zhong-Qiu
Wang, DeLiang
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5580 - 5584
[7] TWO-STAGE SPEECH ENHANCEMENT WITH MANIPULATION OF THE CEPSTRAL EXCITATION
Elshamy, Samy
Madhu, Nilesh
Tirry, Wouter
Fingscheidt, Tim
[J]. 2017 HANDS-FREE SPEECH COMMUNICATIONS AND MICROPHONE ARRAYS (HSCMA 2017), 2017, : 106 - 110
[8] TSTNN: TWO-STAGE TRANSFORMER BASED NEURAL NETWORK FOR SPEECH ENHANCEMENT IN THE TIME DOMAIN
Wan, Kai
He, Bengbeng
Zh, Wei-Ping
[J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7098 - 7102
[9] Investigations on the Optimal Estimation of Speech Envelopes for the Two-Stage Speech Enhancement
Song, Yanjue
Madhu, Nilesh
[J]. SENSORS, 2023, 23 (14)
[10] A two-stage method for single-channel speech enhancement
Hamid, ME
Fukabayashi, T
[J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2006, E89A (04) : 1058 - 1068

← 1 2 3 4 5 →