Wavelet based speech presence probability estimator for speech enhancement

被引:10
|
作者
Lun, Daniel Pak-Kong [1 ]
Shen, Tak-Wai [1 ]
Hsung, Tai-Chiu [1 ]
Ho, Dominic K. C. [2 ]
机构
[1] Hong Kong Polytech Univ, Dept Elect & Informat Engn, Ctr Signal Proc, Kowloon, Hong Kong, Peoples R China
[2] Univ Missouri, Dept Elect & Comp Engn, Columbia, MO USA
关键词
Wavelet denoising; Multitaper spectrum estimation; Speech enhancement; Speech presence probability; SPECTRAL AMPLITUDE ESTIMATOR; VOICE ACTIVITY DETECTION; NOISE; SHRINKAGE; FILTER;
D O I
10.1016/j.dsp.2012.06.011
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A reliable speech presence probability (SPP) estimator is important to many frequency domain speech enhancement algorithms. It is known that a good estimate of SPP can be obtained by having a smooth a-posteriori signal to noise ratio (SNR) function, which can be achieved by reducing the noise variance when estimating the speech power spectrum. Recently, the wavelet denoising with multitaper spectrum (MTS) estimation technique was suggested for such purpose. However, traditional approaches directly make use of the wavelet shrinkage denoiser which has not been fully optimized for denoising the MTS of noisy speech signals. In this paper, we firstly propose a two-stage wavelet denoising algorithm for estimating the speech power spectrum. First, we apply the wavelet transform to the periodogram of a noisy speech signal. Using the resulting wavelet coefficients, an oracle is developed to indicate the approximate locations of the noise floor in the periodogram. Second, we make use of the oracle developed in stage 1 to selectively remove the wavelet coefficients of the noise floor in the log MTS of the noisy speech. The wavelet coefficients that remained are then used to reconstruct a denoised MTS and in turn generate a smooth a-posteriori SNR function. To adapt to the enhanced a-posteriori SNR function, we further propose a new method to estimate the generalized likelihood ratio (GLR), which is an essential parameter for SPP estimation. Simulation results show that the new SPP estimator outperforms the traditional approaches and enables an improvement in both the quality and intelligibility of the enhanced speeches. (C) 2012 Elsevier Inc. All rights reserved.
引用
收藏
页码:1161 / 1173
页数:13
相关论文
共 50 条
  • [31] A Perceptually Motivated Estimator for Speech Enhancement
    Montazeri, Vahid
    Khoubrouy, Soudeh A.
    Panahi, Issa M. S.
    2013 8TH INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS (ISPA), 2013, : 366 - 370
  • [32] An improved SNR estimator for speech enhancement
    Ren, Yao
    Johnson, Michael T.
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4901 - 4904
  • [33] Improved Hybrid Microphone Array Post-filter by Integrating a Robust Speech Absence Probability Estimator for Speech Enhancement
    Li, Junfeng
    Akagi, Masato
    Suzuki, Yoiti
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2130 - +
  • [34] Speech Enhancement Based on the Wiener Filter and Wavelet Entropy
    Jiao, Mingke
    Lou, Lin
    Geng, Xiliang
    Wang, Zhongming
    Zhang, Peng
    Liao, Xijiang
    Zhang, Wenyuan
    2015 12TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2015, : 1956 - 1960
  • [35] Wavelet Speech Enhancement Based on Nonnegative Matrix Factorization
    Wang, Syu-Siang
    Chern, Alan
    Tsao, Yu
    Hung, Jeih-weih
    Lu, Xugang
    Lai, Ying-Hui
    Su, Borching
    IEEE SIGNAL PROCESSING LETTERS, 2016, 23 (08) : 1101 - 1105
  • [36] Wavelet speech enhancement based on the Teager Energy operator
    Bahoura, M
    Rouat, J
    IEEE SIGNAL PROCESSING LETTERS, 2001, 8 (01) : 10 - 12
  • [37] Speech enhancement based on wavelet thresholding the multitaper spectrum
    Hu, Y
    Loizou, PC
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2004, 12 (01): : 59 - 67
  • [38] Deep Learning-based Speech Presence Probability Estimation for Noise PSD Estimation in Single-channel Speech Enhancement
    Yang, Haemin
    Choe, Soyeon
    Kim, Keulbit
    Kang, Hong-Goo
    2018 INTERNATIONAL CONFERENCE ON SIGNALS AND SYSTEMS (ICSIGSYS), 2018, : 267 - 270
  • [39] Optimizing speech enhancement based on noise masked probability
    Dai, QJ
    Chen, YP
    Bian, ZZ
    2002 6TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I AND II, 2002, : 448 - 451
  • [40] Optimizing speech enhancement based on noise masked probability
    2005, Science Press, Beijing, China (27):