Speech enhancement of non-stationary noise based on Controlled Forward Moving Average

被引:1
|
作者
Farrokhi, Dariush [1 ]
Togneri, Roberto [1 ]
Zaknich, Anthony [1 ]
机构
[1] Univ Western Australia, Sch Elect Elect & Comp Engn, Nedlands, WA 6009, Australia
关键词
controlled forward moving average; discrete or prolate spheroidal sequence multi-taper method; noise estimation algorithm; speech enhancement; wavelet thresholding;
D O I
10.1109/ISCIT.2007.4392263
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A pre and post processing technique is proposed to enhance the speech signal of highly non-stationary noisy speech. The purpose of this research has been to build on current speech enhancement algorithms to produce an improved algorithm for enhancement of speech contaminated with non-stationary babble type noise. The pre processing involves two stages. In stage one, the variance of the noisy speech spectrum is reduced by utilizing the Discrete or Prolate Spheroidal Sequence (DPSS) multi-taper algorithm plus a Controlled Forward Moving Average (CFMA) technique. We introduced the CFMA algorithm to smooth and reduce variance of the estimated non-stationary noise spectrum. In the second stage the noisy speech power spectrum is de-noised by applying Stein's Unbiased Risk Estimator (SURE) wavelet thresholding technique. In the third layer, use is made of a noise estimation algorithm with rapid adaptation for a highly non-stationary noise environment. The noise estimate is updated in three frequency sub-bands, by averaging the noisy speech power spectrum using a frequency dependent smoothing factor, which is adjusted, based on a signal presence probability factor. In the fourth layer a spectral subtraction algorithm is used to enhance the speech signal, by subtracting each estimated noise from the original noisy speech. The new proposed post processing is then applied to the complete signal when the speech enhancement is processed using segmental speech enhancement. The enhanced signal is further improved by applying a soft wavelet thresholding technique to the un-segmented enhanced speech at the final processing stage. The results show improvements both quantitatively and qualitatively compared to the speech enhancement that does not apply the CFMA algorithm.
引用
收藏
页码:1551 / 1555
页数:5
相关论文
共 50 条
  • [1] Speech enhancement for non-stationary noise environments
    Cohen, I
    Berdugo, B
    [J]. SIGNAL PROCESSING, 2001, 81 (11) : 2403 - 2418
  • [2] Enhancement and Noise Statistics Estimation for Non-Stationary Voiced Speech
    Norholm, Sidsel Marie
    Jensen, Jesper Rindom
    Christensen, Mads Grsboll
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (04) : 645 - 658
  • [3] SPARSE HMM-BASED SPEECH ENHANCEMENT METHOD FOR STATIONARY AND NON-STATIONARY NOISE ENVIRONMENTS
    Deng, Feng
    Bao, Chang-chun
    Kleijn, W. Bastiaan
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5073 - 5077
  • [4] Speech Enhancement in Non-Stationary Noise Using Compressive Sensing
    Sulong, Amart
    Gunawan, Teddy Surya
    Khalifa, Othman O.
    Kartiwi, Mira
    [J]. PROCEEDINGS OF 6TH INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION ENGINEERING (ICCCE 2016), 2016, : 489 - 493
  • [5] Voicing detection based on adaptive aperiodicity thresholding for speech enhancement in non-stationary noise
    Cabanas-Molero, Pablo
    Martinez-Munoz, Damian
    Vera-Candeas, Pedro
    Ruiz-Reyes, Nicolas
    Jose Rodriguez-Serrano, Francisco
    [J]. IET SIGNAL PROCESSING, 2014, 8 (02) : 119 - 130
  • [6] USING A REMOTE WIRELESS MICROPHONE FOR SPEECH ENHANCEMENT IN NON-STATIONARY NOISE
    Srinivasan, Sriram
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5088 - 5091
  • [7] Robust Estimation of Non-Stationary Noise Power Spectrum for Speech Enhancement
    Mai, Van-Khanh
    Pastor, Dominique
    Aissa-El-Bey, Abdeldjalil
    Le-Bidan, Raphael
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (04) : 670 - 682
  • [8] Single Channel Speech Enhancement for Mixed Non-stationary Noise Environments
    Singh, Sachin
    Tripathy, Manoj
    Anand, R. S.
    [J]. ADVANCES IN SIGNAL PROCESSING AND INTELLIGENT RECOGNITION SYSTEMS, 2014, 264 : 545 - 555
  • [9] Speech enhancement for non-stationary noise environment by adaptive wavelet packet
    Chang, S
    Kwon, Y
    Yang, SI
    Kim, IJ
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 561 - 564
  • [10] Sparse Hidden Markov Models for Speech Enhancement in Non-Stationary Noise Environments
    Deng, Feng
    Bao, Changchun
    Kleijn, W. Bastiaan
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (11) : 1973 - 1987