Wavelet based speech presence probability estimator for speech enhancement

被引:10
|
作者
Lun, Daniel Pak-Kong [1 ]
Shen, Tak-Wai [1 ]
Hsung, Tai-Chiu [1 ]
Ho, Dominic K. C. [2 ]
机构
[1] Hong Kong Polytech Univ, Dept Elect & Informat Engn, Ctr Signal Proc, Kowloon, Hong Kong, Peoples R China
[2] Univ Missouri, Dept Elect & Comp Engn, Columbia, MO USA
关键词
Wavelet denoising; Multitaper spectrum estimation; Speech enhancement; Speech presence probability; SPECTRAL AMPLITUDE ESTIMATOR; VOICE ACTIVITY DETECTION; NOISE; SHRINKAGE; FILTER;
D O I
10.1016/j.dsp.2012.06.011
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A reliable speech presence probability (SPP) estimator is important to many frequency domain speech enhancement algorithms. It is known that a good estimate of SPP can be obtained by having a smooth a-posteriori signal to noise ratio (SNR) function, which can be achieved by reducing the noise variance when estimating the speech power spectrum. Recently, the wavelet denoising with multitaper spectrum (MTS) estimation technique was suggested for such purpose. However, traditional approaches directly make use of the wavelet shrinkage denoiser which has not been fully optimized for denoising the MTS of noisy speech signals. In this paper, we firstly propose a two-stage wavelet denoising algorithm for estimating the speech power spectrum. First, we apply the wavelet transform to the periodogram of a noisy speech signal. Using the resulting wavelet coefficients, an oracle is developed to indicate the approximate locations of the noise floor in the periodogram. Second, we make use of the oracle developed in stage 1 to selectively remove the wavelet coefficients of the noise floor in the log MTS of the noisy speech. The wavelet coefficients that remained are then used to reconstruct a denoised MTS and in turn generate a smooth a-posteriori SNR function. To adapt to the enhanced a-posteriori SNR function, we further propose a new method to estimate the generalized likelihood ratio (GLR), which is an essential parameter for SPP estimation. Simulation results show that the new SPP estimator outperforms the traditional approaches and enables an improvement in both the quality and intelligibility of the enhanced speeches. (C) 2012 Elsevier Inc. All rights reserved.
引用
收藏
页码:1161 / 1173
页数:13
相关论文
共 50 条
  • [41] Masked noise probability-based speech enhancement
    Bian, ZZ
    Dai, QJ
    Chen, YP
    SECOND JOINT EMBS-BMES CONFERENCE 2002, VOLS 1-3, CONFERENCE PROCEEDINGS: BIOENGINEERING - INTEGRATIVE METHODOLOGIES, NEW TECHNOLOGIES, 2002, : 184 - 185
  • [42] Perceptually based speech enhancement using the weighted β-SA estimator
    Plourde, Eric
    Champagne, Benoit
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4193 - 4196
  • [43] NOISE POWER ESTIMATION BASED ON THE PROBABILITY OF SPEECH PRESENCE
    Gerkmann, Timo
    Hendriks, Richard C.
    2011 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2011, : 145 - 148
  • [44] Acoustic echo suppression based on speech presence probability
    Tong, Ying
    Gu, Yaping
    2016 IEEE INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2016, : 35 - 38
  • [45] Speech enhancement based on modified Mel masking model and speech absence probability in whispers
    Tao, Zhi
    Zhao, Heming
    Wu, Di
    Chen, Daqing
    Zhang, Xiaojun
    Shengxue Xuebao/Acta Acustica, 2009, 34 (04): : 370 - 377
  • [46] Distributed Speech Presence Probability Estimator in Fully Connected Wireless Acoustic Sensor Networks
    Ranjbaryan, Raziyeh
    Abutalebi, Hamid Reza
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2020, 39 (12) : 6121 - 6141
  • [47] Distributed Speech Presence Probability Estimator in Fully Connected Wireless Acoustic Sensor Networks
    Raziyeh Ranjbaryan
    Hamid Reza Abutalebi
    Circuits, Systems, and Signal Processing, 2020, 39 : 6121 - 6141
  • [48] Speech enhancement based on stationary bionic wavelet transform and maximum a posterior estimator of magnitude-squared spectrum
    Mourad T.
    International Journal of Speech Technology, 2017, 20 (1) : 75 - 88
  • [49] A Wavelet Fusion Method for Speech Enhancement
    Xia, Bing-yin
    Bao, Chang-chun
    PROCEEDINGS OF 2012 IEEE 11TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP) VOLS 1-3, 2012, : 473 - 476
  • [50] An improved estimation of a priori speech absence probability for speech enhancement: In perspective of speech perception
    Choi, MS
    Kang, HG
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 1117 - 1120