Wavelet based speech presence probability estimator for speech enhancement

被引:10
|
作者
Lun, Daniel Pak-Kong [1 ]
Shen, Tak-Wai [1 ]
Hsung, Tai-Chiu [1 ]
Ho, Dominic K. C. [2 ]
机构
[1] Hong Kong Polytech Univ, Dept Elect & Informat Engn, Ctr Signal Proc, Kowloon, Hong Kong, Peoples R China
[2] Univ Missouri, Dept Elect & Comp Engn, Columbia, MO USA
关键词
Wavelet denoising; Multitaper spectrum estimation; Speech enhancement; Speech presence probability; SPECTRAL AMPLITUDE ESTIMATOR; VOICE ACTIVITY DETECTION; NOISE; SHRINKAGE; FILTER;
D O I
10.1016/j.dsp.2012.06.011
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A reliable speech presence probability (SPP) estimator is important to many frequency domain speech enhancement algorithms. It is known that a good estimate of SPP can be obtained by having a smooth a-posteriori signal to noise ratio (SNR) function, which can be achieved by reducing the noise variance when estimating the speech power spectrum. Recently, the wavelet denoising with multitaper spectrum (MTS) estimation technique was suggested for such purpose. However, traditional approaches directly make use of the wavelet shrinkage denoiser which has not been fully optimized for denoising the MTS of noisy speech signals. In this paper, we firstly propose a two-stage wavelet denoising algorithm for estimating the speech power spectrum. First, we apply the wavelet transform to the periodogram of a noisy speech signal. Using the resulting wavelet coefficients, an oracle is developed to indicate the approximate locations of the noise floor in the periodogram. Second, we make use of the oracle developed in stage 1 to selectively remove the wavelet coefficients of the noise floor in the log MTS of the noisy speech. The wavelet coefficients that remained are then used to reconstruct a denoised MTS and in turn generate a smooth a-posteriori SNR function. To adapt to the enhanced a-posteriori SNR function, we further propose a new method to estimate the generalized likelihood ratio (GLR), which is an essential parameter for SPP estimation. Simulation results show that the new SPP estimator outperforms the traditional approaches and enables an improvement in both the quality and intelligibility of the enhanced speeches. (C) 2012 Elsevier Inc. All rights reserved.
引用
收藏
页码:1161 / 1173
页数:13
相关论文
共 50 条
  • [11] Optimized Sigmoid Functions for Speech Presence Probability and Gain Function in Speech Enhancement
    Hai Huyen Dam
    Sven Nordholm
    Pei Chee Yong
    Siow Yong Low
    Circuits, Systems, and Signal Processing, 2024, 43 : 2891 - 2908
  • [12] Speech Enhancement Combining NMF Weighted by Speech Presence Probability and Statistical Model
    Hu, Yonggang
    Zhang, Xiongwei
    Zou, Xia
    Min, Gang
    Sun, Meng
    Zheng, Yunfei
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2015, E98A (12) : 2701 - 2704
  • [13] Robust Keyword Spotting for Noisy Environments by Leveraging Speech Enhancement and Speech Presence Probability
    Yang, Chouchang
    Saidutta, Yashas Malur
    Srinivasa, Rakshith Sharma
    Lee, Ching-Hua
    Shen, Yilin
    Jin, Hongxia
    INTERSPEECH 2023, 2023, : 1638 - 1642
  • [14] Speech enhancement based on morphology and wavelet filter
    Wang, X
    Tang, HM
    Zhao, XQ
    ICEMI 2005: CONFERENCE PROCEEDINGS OF THE SEVENTH INTERNATIONAL CONFERENCE ON ELECTRONIC MEASUREMENT & INSTRUMENTS, VOL 6, 2005, : 237 - 240
  • [15] A multichannel subspace approach with signal presence probability for speech enhancement
    Jungpyo Hong
    Multidimensional Systems and Signal Processing, 2019, 30 : 2045 - 2058
  • [16] HMM-based noise estimator for speech enhancement
    许春冬
    夏日升
    应冬文
    李军锋
    颜永红
    Journal of Beijing Institute of Technology, 2014, 23 (04) : 549 - 556
  • [17] AdaBoost Noise Estimator for Subspace based Speech Enhancement
    Dahlan, Rico
    2018 INTERNATIONAL CONFERENCE ON COMPUTER, CONTROL, INFORMATICS AND ITS APPLICATIONS (IC3INA), 2018, : 110 - 113
  • [18] HMM-based noise estimator for speech enhancement
    Xia, Ri-Sheng (xiarisheng@hccl.ioa.ac.cn), 1600, Beijing Institute of Technology (23):
  • [19] A Laplacian-based MMSE estimator for speech enhancement
    Chen, Bin
    Loizou, Philipos C.
    SPEECH COMMUNICATION, 2007, 49 (02) : 134 - 143
  • [20] A multichannel subspace approach with signal presence probability for speech enhancement
    Hong, Jungpyo
    MULTIDIMENSIONAL SYSTEMS AND SIGNAL PROCESSING, 2019, 30 (04) : 2045 - 2058