Speech enhancement of non-stationary noise based on Controlled Forward Moving Average

被引：1

作者：

Farrokhi, Dariush ^{[1
]}

Togneri, Roberto ^{[1
]}

Zaknich, Anthony ^{[1
]}

机构：

[1] Univ Western Australia, Sch Elect Elect & Comp Engn, Nedlands, WA 6009, Australia

来源：

2007 INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES, VOLS 1-3 | 2007年

关键词：

controlled forward moving average; discrete or prolate spheroidal sequence multi-taper method; noise estimation algorithm; speech enhancement; wavelet thresholding;

D O I：

10.1109/ISCIT.2007.4392263

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

A pre and post processing technique is proposed to enhance the speech signal of highly non-stationary noisy speech. The purpose of this research has been to build on current speech enhancement algorithms to produce an improved algorithm for enhancement of speech contaminated with non-stationary babble type noise. The pre processing involves two stages. In stage one, the variance of the noisy speech spectrum is reduced by utilizing the Discrete or Prolate Spheroidal Sequence (DPSS) multi-taper algorithm plus a Controlled Forward Moving Average (CFMA) technique. We introduced the CFMA algorithm to smooth and reduce variance of the estimated non-stationary noise spectrum. In the second stage the noisy speech power spectrum is de-noised by applying Stein's Unbiased Risk Estimator (SURE) wavelet thresholding technique. In the third layer, use is made of a noise estimation algorithm with rapid adaptation for a highly non-stationary noise environment. The noise estimate is updated in three frequency sub-bands, by averaging the noisy speech power spectrum using a frequency dependent smoothing factor, which is adjusted, based on a signal presence probability factor. In the fourth layer a spectral subtraction algorithm is used to enhance the speech signal, by subtracting each estimated noise from the original noisy speech. The new proposed post processing is then applied to the complete signal when the speech enhancement is processed using segmental speech enhancement. The enhanced signal is further improved by applying a soft wavelet thresholding technique to the un-segmented enhanced speech at the final processing stage. The results show improvements both quantitatively and qualitatively compared to the speech enhancement that does not apply the CFMA algorithm.

引用

页码：1551 / 1555

页数：5

共 50 条

[41] The phase space of non-stationary noise
Galleani, L
Cohen, L
[J]. JOURNAL OF MODERN OPTICS, 2004, 51 (16-18) : 2731 - 2740
[42] Non-stationary correlation matrices and noise
Martins, Andre C. R.
[J]. PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2007, 379 (02) : 552 - 558
[43] DETECTION OF A NON-STATIONARY SIGNAL IN NOISE
MCNEIL, DR
[J]. AUSTRALIAN JOURNAL OF PHYSICS, 1967, 20 (03): : 325 - +
[44] A PREDICTION METHOD OF NON-STATIONARY ROAD TRAFFIC NOISE BASED ON FLUCTUATION PATTERNS OF AN AVERAGE NUMBER OF FLOWING VEHICLES
YAMAGUCHI, S
KATO, Y
[J]. APPLIED ACOUSTICS, 1989, 27 (02) : 103 - 118
[45] Stationary and non-stationary noise in superconducting quantum devices
Martin, I.
Bulaevskii, L.
Shnirman, A.
Galperin, Y. M.
[J]. NOISE AND FLUCTUATIONS IN CIRCUITS, DEVICES, AND MATERIALS, 2007, 6600
[46] Non-stationary signal noise suppression based on wavelet analysis
Qu Wei
Jia Xin
Pei Shibing
Wu Jie
[J]. CISP 2008: FIRST INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOL 4, PROCEEDINGS, 2008, : 303 - 306
[47] A noise reduction method for non-stationary noise based on noise reconstruction system with ALE
Sasaoka, N
Itoh, Y
Fujii, K
[J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2005, E88A (02) : 593 - 596
[48] Dynamic adjustment of the forgetting factor in adaptive filters for non-stationary noise cancellation in speech
Martinez, R
Gomez, P
Alvarez, A
Nieto, V
Rodellar, V
Rubio, M
Perez, M
[J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 1009 - 1012
[49] AN ANALYSIS OF VECTOR TAYLOR SERIES MODEL COMPENSATION FOR NON-STATIONARY NOISE IN SPEECH RECOGNITION
Duc Hoang Ha Nguyen
Xiao, Xiong
Chng, Eng Siong
Li, Haizhou
[J]. 2012 8TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, 2012, : 131 - 135
[50] Speech recognition in non-stationary adverse environments
Wang, ZH
Kenny, P
[J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 265 - 268

← 1 2 3 4 5 →