Wavelet based speech presence probability estimator for speech enhancement

被引:10
|
作者
Lun, Daniel Pak-Kong [1 ]
Shen, Tak-Wai [1 ]
Hsung, Tai-Chiu [1 ]
Ho, Dominic K. C. [2 ]
机构
[1] Hong Kong Polytech Univ, Dept Elect & Informat Engn, Ctr Signal Proc, Kowloon, Hong Kong, Peoples R China
[2] Univ Missouri, Dept Elect & Comp Engn, Columbia, MO USA
关键词
Wavelet denoising; Multitaper spectrum estimation; Speech enhancement; Speech presence probability; SPECTRAL AMPLITUDE ESTIMATOR; VOICE ACTIVITY DETECTION; NOISE; SHRINKAGE; FILTER;
D O I
10.1016/j.dsp.2012.06.011
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A reliable speech presence probability (SPP) estimator is important to many frequency domain speech enhancement algorithms. It is known that a good estimate of SPP can be obtained by having a smooth a-posteriori signal to noise ratio (SNR) function, which can be achieved by reducing the noise variance when estimating the speech power spectrum. Recently, the wavelet denoising with multitaper spectrum (MTS) estimation technique was suggested for such purpose. However, traditional approaches directly make use of the wavelet shrinkage denoiser which has not been fully optimized for denoising the MTS of noisy speech signals. In this paper, we firstly propose a two-stage wavelet denoising algorithm for estimating the speech power spectrum. First, we apply the wavelet transform to the periodogram of a noisy speech signal. Using the resulting wavelet coefficients, an oracle is developed to indicate the approximate locations of the noise floor in the periodogram. Second, we make use of the oracle developed in stage 1 to selectively remove the wavelet coefficients of the noise floor in the log MTS of the noisy speech. The wavelet coefficients that remained are then used to reconstruct a denoised MTS and in turn generate a smooth a-posteriori SNR function. To adapt to the enhanced a-posteriori SNR function, we further propose a new method to estimate the generalized likelihood ratio (GLR), which is an essential parameter for SPP estimation. Simulation results show that the new SPP estimator outperforms the traditional approaches and enables an improvement in both the quality and intelligibility of the enhanced speeches. (C) 2012 Elsevier Inc. All rights reserved.
引用
收藏
页码:1161 / 1173
页数:13
相关论文
共 50 条
  • [1] Speech enhancement via two-stage dual tree complex wavelet packet transform with a speech presence probability estimator
    Sun, Pengfei
    Qin, Jun
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2017, 141 (02): : 808 - 817
  • [2] Speech enhancement using voiced speech probability based wavelet decomposition
    Bhowmick, Anirban
    Chandra, Mahesh
    COMPUTERS & ELECTRICAL ENGINEERING, 2017, 62 : 706 - 718
  • [3] Binaural Codebook-Based Speech Enhancement With Atomic Speech Presence Probability
    Wood, Sean U. N.
    Stahl, Johannes K. W.
    Mowlaee, Pejman
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (12) : 2150 - 2161
  • [4] A SPEECH PRESENCE PROBABILITY ESTIMATOR BASED ON FIXED PRIORS AND A HEAVY-TAILED SPEECH MODEL
    Fodor, Balazs
    Gerkmann, Timo
    2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 2305 - 2309
  • [5] Improved Speech Presence Probability Estimation Based on Wavelet Denoising
    Lun, Daniel Pak-Kong
    Shen, Tak-Wai
    Hsung, Tai-Chiu
    Ho, Dominic K. C.
    2012 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 2012), 2012, : 1018 - 1021
  • [6] Unsupervised Speech Enhancement Using Optimal Transport and Speech Presence Probability
    Jiang, Wenbin
    Yu, Kai
    Wen, Fei
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 4445 - 4455
  • [7] Codebook-Based Speech Enhancement Using Markov Process and Speech-presence Probability
    He, Qi
    Bao, Chang-chun
    Bao, Feng
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1780 - 1784
  • [8] Generalization of Maximum A Posteriori Amplitude Estimator Under Speech Presence Uncertainty for Speech Enhancement
    Hajar Momeni
    Hamid Reza Abutalebi
    Circuits, Systems, and Signal Processing, 2014, 33 : 2565 - 2582
  • [9] Generalization of Maximum A Posteriori Amplitude Estimator Under Speech Presence Uncertainty for Speech Enhancement
    Momeni, Hajar
    Abutalebi, Hamid Reza
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2014, 33 (08) : 2565 - 2582
  • [10] Optimized Sigmoid Functions for Speech Presence Probability and Gain Function in Speech Enhancement
    Dam, Hai Huyen
    Nordholm, Sven
    Yong, Pei Chee
    Low, Siow Yong
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2024, 43 (05) : 2891 - 2908