Noise estimation based on time-frequency correlation for speech enhancement

被引:5
|
作者
Yuan, Wenhao [1 ]
Lin, Jiajun [1 ]
An, Wei [1 ]
Wang, Yu [1 ]
Chen, Ning [1 ]
机构
[1] E China Univ Sci & Technol, Sch Informat Sci & Engn, Shanghai 200237, Peoples R China
基金
中国国家自然科学基金;
关键词
Noise estimation; Speech enhancement; Minimum search; Improved Minima Controlled Recursive; Averaging; ENVIRONMENTS; RECOGNITION;
D O I
10.1016/j.apacoust.2012.11.007
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
As a fundamental part of speech enhancement, noise estimation is particularly challenging in highly nonstationary noise environments. In this work, we propose an effective algorithm on the basis of the "Improved Minima Controlled Recursive Averaging (IMCRA)" with the objective to improve the performance of noise estimation. The main contributions of this work are: (i) in the algorithm, a rough decision about speech presence is proposed by calculating the autocorrelation and cross-channel correlation of the T-F (Time-Frequency) units; (ii) with this decision, we refine the smoothing parameters for the smoothing of noisy power spectrum and the recursive averaging in noise spectrum estimation as well as the weighting factor for the a priori SNR (Signal to Noise Ratio) estimation in the IMCRA; (iii) we improve the search of local minima during spectral bursts by adding a minimum search with a shorter window. Extensive experiments are carried out to evaluate the performance of our proposed algorithm. The experimental results illustrate that, compared with the IMCRA, the proposed approach significantly improves the accuracy of noise spectrum estimation and the quality of enhanced speech in the typical noise situations. (c) 2012 Elsevier Ltd. All rights reserved.
引用
收藏
页码:770 / 781
页数:12
相关论文
共 50 条
  • [41] Speech intelligibility in background noise with ideal binary time-frequency masking
    Wang, DeLiang
    Kjems, Ulrik
    Pedersen, Michael S.
    Boldt, Jesper B.
    Lunner, Thomas
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2009, 125 (04): : 2336 - 2347
  • [42] A time-frequency fusion model for multi-channel speech enhancement
    Zeng, Xiao
    Xu, Shiyun
    Wang, Mingjiang
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2024, 2024 (01):
  • [43] Continuous Noise Estimation Using Time-Frequency ECG Representation
    Augustyniak, Piotr
    [J]. 2011 COMPUTING IN CARDIOLOGY, 2011, 38 : 133 - 136
  • [44] Deep Learning Assisted Time-Frequency Processing for Speech Enhancement on Drones
    Wang, Lin
    Cavallaro, Andrea
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2021, 5 (06): : 871 - 881
  • [45] Time-Frequency Domain Impulsive Noise Detection System in Speech Signal
    Choi, Min-Seok
    Shin, Ho Seon
    Hwang, Young-Soo
    Kang, Hong-Goo
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2011, 30 (02): : 73 - 79
  • [46] Single channel speech enhancement via time-frequency dictionary learning
    HUANG Jianjun
    ZHANG Xiongwei
    ZHANG Yafei
    ZOU Xia
    [J]. Chinese Journal of Acoustics, 2013, 32 (01) : 90 - 102
  • [47] Residual Unet with Attention Mechanism for Time-Frequency Domain Speech Enhancement
    Chen, Hanyu
    Peng, Xiwei
    Jiang, Qiqi
    Guo, Yujie
    [J]. 2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 7007 - 7011
  • [48] Perceptual effects of noise reduction by time-frequency masking of noisy speech
    Brons, Inge
    Houben, Rolph
    Dreschler, Wouter A.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2012, 132 (04): : 2690 - 2699
  • [49] Review of Time-Frequency Masking Approach for Improving Speech Intelligibility in Noise
    Kim, Gibak
    [J]. IETE TECHNICAL REVIEW, 2022, 39 (03) : 623 - 634
  • [50] Time-frequency thresholding: A new algorithm in wavelet package speech enhancement
    Wang, Gang
    Xu, Yaohua
    Li, Xiaolin
    [J]. CISP 2008: FIRST INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOL 4, PROCEEDINGS, 2008, : 327 - +