Wavelet based speech presence probability estimator for speech enhancement

被引:10
|
作者
Lun, Daniel Pak-Kong [1 ]
Shen, Tak-Wai [1 ]
Hsung, Tai-Chiu [1 ]
Ho, Dominic K. C. [2 ]
机构
[1] Hong Kong Polytech Univ, Dept Elect & Informat Engn, Ctr Signal Proc, Kowloon, Hong Kong, Peoples R China
[2] Univ Missouri, Dept Elect & Comp Engn, Columbia, MO USA
关键词
Wavelet denoising; Multitaper spectrum estimation; Speech enhancement; Speech presence probability; SPECTRAL AMPLITUDE ESTIMATOR; VOICE ACTIVITY DETECTION; NOISE; SHRINKAGE; FILTER;
D O I
10.1016/j.dsp.2012.06.011
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A reliable speech presence probability (SPP) estimator is important to many frequency domain speech enhancement algorithms. It is known that a good estimate of SPP can be obtained by having a smooth a-posteriori signal to noise ratio (SNR) function, which can be achieved by reducing the noise variance when estimating the speech power spectrum. Recently, the wavelet denoising with multitaper spectrum (MTS) estimation technique was suggested for such purpose. However, traditional approaches directly make use of the wavelet shrinkage denoiser which has not been fully optimized for denoising the MTS of noisy speech signals. In this paper, we firstly propose a two-stage wavelet denoising algorithm for estimating the speech power spectrum. First, we apply the wavelet transform to the periodogram of a noisy speech signal. Using the resulting wavelet coefficients, an oracle is developed to indicate the approximate locations of the noise floor in the periodogram. Second, we make use of the oracle developed in stage 1 to selectively remove the wavelet coefficients of the noise floor in the log MTS of the noisy speech. The wavelet coefficients that remained are then used to reconstruct a denoised MTS and in turn generate a smooth a-posteriori SNR function. To adapt to the enhanced a-posteriori SNR function, we further propose a new method to estimate the generalized likelihood ratio (GLR), which is an essential parameter for SPP estimation. Simulation results show that the new SPP estimator outperforms the traditional approaches and enables an improvement in both the quality and intelligibility of the enhanced speeches. (C) 2012 Elsevier Inc. All rights reserved.
引用
收藏
页码:1161 / 1173
页数:13
相关论文
共 50 条
  • [21] Enhancement of Noisy Speech using Sub-band Harmonic Regeneration and Speech Presence Uncertainty Estimator
    Kumar, Ravi
    Subbaiah, P. V.
    2016 IEEE INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ELECTRONICS, INFORMATION & COMMUNICATION TECHNOLOGY (RTEICT), 2016, : 456 - 460
  • [22] Minimum mean square error estimator for speech enhancement in additive noise assuming Weibull speech priors and speech presence uncertainty
    Bahrami, Mojtaba
    Faraji, Neda
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2021, 24 (01) : 97 - 108
  • [23] Minimum mean square error estimator for speech enhancement in additive noise assuming Weibull speech priors and speech presence uncertainty
    Mojtaba Bahrami
    Neda Faraji
    International Journal of Speech Technology, 2021, 24 : 97 - 108
  • [24] Speech Enhancement Based on Teacher-Student Deep Learning Using Improved Speech Presence Probability for Noise-Robust Speech Recognition
    Tu, Yan-Hui
    Du, Jun
    Lee, Chin-Hui
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (12) : 2080 - 2091
  • [25] SPEECH PRESENCE PROBABILITY ESTIMATION BASED ON INTEGRATED TIME-FREQUENCY MINIMUM TRACKING FOR SPEECH ENHANCEMENT IN ADVERSE ENVIRONMENTS
    Fu, Zhong-Hua
    Wang, Jhing-Fa
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4258 - 4261
  • [26] DNN-BASED SPEECH PRESENCE PROBABILITY ESTIMATION FORMULTI-FRAME SINGLE-MICROPHONE SPEECH ENHANCEMENT
    Tammen, Marvin
    Fischer, Doerte
    Meyer, Bernd T.
    Doclo, Simon
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 191 - 195
  • [27] A Generalized Subspace Approach for Multichannel Speech Enhancement Using Machine Learning-Based Speech Presence Probability Estimation
    Ke, Yuxuan
    Hu, Yi
    Li, Jian
    Zheng, Chengshi
    Li, Xiaodong
    146TH AES CONVENTION, 2019,
  • [28] An improved wavelet-based speech enhancement by using speech signal features
    Ayat, Saeed
    Manzuri-Shalmani, M. T.
    Dianat, Roohollah
    COMPUTERS & ELECTRICAL ENGINEERING, 2006, 32 (06) : 411 - 425
  • [29] A SPEECH PRESENCE MICROPHONE ARRAY BEAMFORMER USING MODEL BASED SPEECH PRESENCE PROBABILITY ESTIMATION
    Yu, Tao
    Hansen, John H. L.
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 213 - 216
  • [30] A SPEECH SPECTRAL ESTIMATOR USING ADAPTIVE SPEECH PROBABILITY DENSITY FUNCTION
    Kawamura, Arata
    Thanhikam, W.
    Iiguni, Youji
    18TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2010), 2010, : 1549 - 1552