A generalized time-frequency subtraction method for robust, speech enhancement based on wavelet filter banks modeling of human auditory system

被引：28

作者：

Shao, Yu ^{[1
]}

Chang, Chip-Hong ^{[1
]}

机构：

[1] Nanyang Technol Univ, Ctr High Performance Embedded Syst, Singapore 63755, Singapore

来源：

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS | 2007年 / 37卷 / 04期

关键词：

auditory masking; noise reduction; speech enhancement; wavelet;

D O I：

10.1109/TSMCB.2007.895365

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We present a new speech enhancement scheme for a single-microphone system to meet the demand for quality noise reduction algorithms capable of operating at a very low signal-to-noise ratio. A psychoacoustic model is incorporated into the generalized perceptual wavelet denoising method to reduce the residual noise and, improve the Intelligibility of speech. The proposed method is a generalized time-frequency subtraction algorithm, which advantageously exploits the wavelet multirate signal representation to preserve the critical transient information. Simultaneous masking and temporal masking of the human auditory system are modeled by the perceptual wavelet packet transform via the frequency and temporal localization of speech components. The wavelet coefficients are used to calculate the Bark spreading energy and temporal spreading energy, from which a time-frequency masking threshold is deduced to adaptively adjust the subtraction parameters of the proposed method. An unvoiced speech enhancement algorithm is also integrated into the system to improve the intelligibility of speech. Through rigorous objective and subjective evaluations, it is shown that the proposed speech enhancement system is capable of reducing noise with little speech degradation in adverse noise environments and the overall performance is superior to several competitive methods.

引用

页码：877 / 889

页数：13

共 50 条

[1] A. generalized perceptual time-frequency subtraction method for speech enhancement
Shao, Yu
Chang, Chip-Hong
[J]. 2006 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11, PROCEEDINGS, 2006, : 2537 - +
[2] A Wavelet-Based Denoising System Using Time-Frequency Adaptation for Speech Enhancement
Wang, Kun-Ching
Chin, Chuin-Li
Tsai, Yi-Hsing
[J]. 2009 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, 2009, : 114 - 117
[3] Wavelet-Based Speech Enhancement Using Time-Frequency Adaptation
Wang, Kun-Ching
[J]. EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2009,
[4] Wavelet-Based Speech Enhancement Using Time-Frequency Adaptation
Kun-Ching Wang
[J]. EURASIP Journal on Advances in Signal Processing, 2009
[5] A Modified Spectral Subtraction Method for Speech Enhancement Based on Masking Property of Human Auditory System
Xia, Bing-yin
Liang, Yan
Bao, Chang-chun
[J]. 2009 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP 2009), 2009, : 942 - 946
[6] Lung Sound Recognition Method Based on Wavelet Feature Enhancement and Time-Frequency Synchronous Modeling
Shi, Lukui
Zhang, Yixuan
Zhang, Jingye
[J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2023, 27 (01) : 308 - 318
[7] Phasor-Banks: Customizable Filter Banks for Robust Dynamic Time-Frequency Analysis
Lev-Ari, Hanoch
Stankovic, Aleksandar M.
[J]. 2008 40TH NORTH AMERICAN POWER SYMPOSIUM (NAPS 2008), 2008, : 575 - 582
[8] Bionic wavelet transform: A new time-frequency method based on an auditory model
Yao, J
Zhang, YT
[J]. IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2001, 48 (08) : 856 - 863
[9] Time-frequency analysis and auditory modeling for automatic recognition of speech
Pitton, JW
Wang, KS
Juang, BH
[J]. PROCEEDINGS OF THE IEEE, 1996, 84 (09) : 1199 - 1215
[10] Time-frequency thresholding: A new algorithm in wavelet package speech enhancement
Wang, Gang
Xu, Yaohua
Li, Xiaolin
[J]. CISP 2008: FIRST INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOL 4, PROCEEDINGS, 2008, : 327 - +

← 1 2 3 4 5 →