Supervised Single-Channel Speech Dereverberation and Denoising Using a Two-Stage Processing

被引:0
|
作者
Zhang, Long [1 ]
Ehen, Jiaxu [1 ]
Luo, You [1 ]
Fu, Jiafei [1 ]
Ye, Zhongfu [1 ]
机构
[1] Univ Sci & Technol China, Dept Elect Engn & Informat Sci, Natl Engn Lab Speech & Language Informat Proc, Hefei, Anhui, Peoples R China
基金
中国国家自然科学基金;
关键词
speech dereverberation and denoising; room impulse response; non-negative matrix jactorization; two-stage processing; ENHANCEMENT;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In many acoustic conditions, a single-channel recorded speech signal may be severely affected by reverberation and noise, leading to a reduced speech quality and intelligibility. This paper focuses on proposing a novel two-stage processing scheme for single-channel speech dereverberation and denoising to enhance the spectrum of the noisy reverberant signal. Similar as previous methods, the proposed method uses a non-negative approximation of the convolutive transfer function (N-CTF) to simultaneously estimate the magnitude spectrograms of the speech signal and the room impulse response (RIR). What's the novelty of proposed algorithm is decomposing the RIRs into two parts to build a two-stage processing scheme for enhancing speech from the noisy environments. The proposed algorithm is iteratively updated to estimate a less reverberant speech signal and a short RIR at first stage, then the clean speech signal and another short RIR are estimated by iteratively updating at the second stage. There are always denosing process steps within both stages. The advantages of our proposed algorithm are more capable to enhance the speech and more time-saving by decomposing the long RIRs into two parts. Additionally, the optimal estimator is derived based on temporal stacking to utilize speech temporal dynamics. Experiments are performed on two simulated RIRs to compare the performances of the proposed method with a state-of-the-art method and the results show that the proposed method has significantly improved the enhanced speech quality and intelligibility.
引用
收藏
页码:818 / 822
页数:5
相关论文
共 50 条
  • [1] Supervised single-channel speech dereverberation and denoising using a two-stage model based sparse representation
    Zhang Long
    Xu Xu
    Chen Huang
    Chen Jiaxu
    Ye Zhongfu
    [J]. SPEECH COMMUNICATION, 2018, 97 : 1 - 8
  • [2] Two-Stage Temporal Processing for Single-Channel Speech Enhancement
    Samui, Sunzan
    Chakrabarti, Indrajit
    Ghosh, Soumya Kanti
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3723 - 3727
  • [3] A two-stage method for single-channel speech enhancement
    Hamid, ME
    Fukabayashi, T
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2006, E89A (04) : 1058 - 1068
  • [4] Single-Channel Speech Dereverberation in Acoustical Environments
    Joorabchi, Marjan
    Ghorshi, Seyed
    Sarafnia, Ali
    [J]. 2014 56TH INTERNATIONAL SYMPOSIUM ELMAR (ELMAR), 2014, : 211 - 214
  • [5] A Novel Scheme for Single-Channel Speech Dereverberation
    Kilis, Nikolaos
    Mitianoudis, Nikolaos
    [J]. ACOUSTICS, 2019, 1 (03): : 711 - 725
  • [6] SINGLE CHANNEL JOINT SPEECH DEREVERBERATION AND DENOISING USING DEEP PRIORS
    Raikar, Aditya
    Basu, Sourya
    Hegde, Rajesh M.
    [J]. 2018 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2018), 2018, : 216 - 220
  • [7] Two-Stage Single-Channel Speech Enhancement with Multi-Frame Filtering
    Lin, Shaoxiong
    Zhang, Wangyou
    Qian, Yanmin
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (08):
  • [8] A New Two-Stage Method for Single-Microphone Speech Dereverberation
    Baghaki, Ali
    Ahmad, M. Omair
    Swamy, M. N. S.
    [J]. 2016 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2016, : 778 - 781
  • [9] Single-channel Speech Dereverberation via Generative Adversarial Training
    Li, Chenxing
    Wang, Tieqiang
    Xu, Shuang
    Xu, Bo
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1309 - 1313
  • [10] Perceptual Improvement of a Two-Stage Algorithm for Speech Dereverberation
    Prego, Thiago de M.
    de Lima, Amaro A.
    Netto, Sergio L.
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 216 - 219