Supervised Single-Channel Speech Dereverberation and Denoising Using a Two-Stage Processing

被引：0

作者：

Zhang, Long ^{[1
]}

Ehen, Jiaxu ^{[1
]}

Luo, You ^{[1
]}

Fu, Jiafei ^{[1
]}

Ye, Zhongfu ^{[1
]}

机构：

[1] Univ Sci & Technol China, Dept Elect Engn & Informat Sci, Natl Engn Lab Speech & Language Informat Proc, Hefei, Anhui, Peoples R China

来源：

2017 2ND INTERNATIONAL CONFERENCE ON IMAGE, VISION AND COMPUTING (ICIVC 2017) | 2017年

基金：

中国国家自然科学基金;

关键词：

speech dereverberation and denoising; room impulse response; non-negative matrix jactorization; two-stage processing; ENHANCEMENT;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In many acoustic conditions, a single-channel recorded speech signal may be severely affected by reverberation and noise, leading to a reduced speech quality and intelligibility. This paper focuses on proposing a novel two-stage processing scheme for single-channel speech dereverberation and denoising to enhance the spectrum of the noisy reverberant signal. Similar as previous methods, the proposed method uses a non-negative approximation of the convolutive transfer function (N-CTF) to simultaneously estimate the magnitude spectrograms of the speech signal and the room impulse response (RIR). What's the novelty of proposed algorithm is decomposing the RIRs into two parts to build a two-stage processing scheme for enhancing speech from the noisy environments. The proposed algorithm is iteratively updated to estimate a less reverberant speech signal and a short RIR at first stage, then the clean speech signal and another short RIR are estimated by iteratively updating at the second stage. There are always denosing process steps within both stages. The advantages of our proposed algorithm are more capable to enhance the speech and more time-saving by decomposing the long RIRs into two parts. Additionally, the optimal estimator is derived based on temporal stacking to utilize speech temporal dynamics. Experiments are performed on two simulated RIRs to compare the performances of the proposed method with a state-of-the-art method and the results show that the proposed method has significantly improved the enhanced speech quality and intelligibility.

引用

页码：818 / 822

页数：5

共 50 条

[1] Supervised single-channel speech dereverberation and denoising using a two-stage model based sparse representation
Zhang Long
Xu Xu
Chen Huang
Chen Jiaxu
Ye Zhongfu
[J]. SPEECH COMMUNICATION, 2018, 97 : 1 - 8
[2] Two-Stage Temporal Processing for Single-Channel Speech Enhancement
Samui, Sunzan
Chakrabarti, Indrajit
Ghosh, Soumya Kanti
[J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3723 - 3727
[3] A two-stage method for single-channel speech enhancement
Hamid, ME
Fukabayashi, T
[J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2006, E89A (04) : 1058 - 1068
[4] Single-Channel Speech Dereverberation in Acoustical Environments
Joorabchi, Marjan
Ghorshi, Seyed
Sarafnia, Ali
[J]. 2014 56TH INTERNATIONAL SYMPOSIUM ELMAR (ELMAR), 2014, : 211 - 214
[5] A Novel Scheme for Single-Channel Speech Dereverberation
Kilis, Nikolaos
Mitianoudis, Nikolaos
[J]. ACOUSTICS, 2019, 1 (03): : 711 - 725
[6] SINGLE CHANNEL JOINT SPEECH DEREVERBERATION AND DENOISING USING DEEP PRIORS
Raikar, Aditya
Basu, Sourya
Hegde, Rajesh M.
[J]. 2018 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2018), 2018, : 216 - 220
[7] Two-Stage Single-Channel Speech Enhancement with Multi-Frame Filtering
Lin, Shaoxiong
Zhang, Wangyou
Qian, Yanmin
[J]. APPLIED SCIENCES-BASEL, 2023, 13 (08):
[8] A New Two-Stage Method for Single-Microphone Speech Dereverberation
Baghaki, Ali
Ahmad, M. Omair
Swamy, M. N. S.
[J]. 2016 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2016, : 778 - 781
[9] Single-channel Speech Dereverberation via Generative Adversarial Training
Li, Chenxing
Wang, Tieqiang
Xu, Shuang
Xu, Bo
[J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1309 - 1313
[10] Perceptual Improvement of a Two-Stage Algorithm for Speech Dereverberation
Prego, Thiago de M.
de Lima, Amaro A.
Netto, Sergio L.
[J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 216 - 219

← 1 2 3 4 5 →