A generalized time-frequency subtraction method for robust, speech enhancement based on wavelet filter banks modeling of human auditory system

被引:28
|
作者
Shao, Yu [1 ]
Chang, Chip-Hong [1 ]
机构
[1] Nanyang Technol Univ, Ctr High Performance Embedded Syst, Singapore 63755, Singapore
关键词
auditory masking; noise reduction; speech enhancement; wavelet;
D O I
10.1109/TSMCB.2007.895365
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present a new speech enhancement scheme for a single-microphone system to meet the demand for quality noise reduction algorithms capable of operating at a very low signal-to-noise ratio. A psychoacoustic model is incorporated into the generalized perceptual wavelet denoising method to reduce the residual noise and, improve the Intelligibility of speech. The proposed method is a generalized time-frequency subtraction algorithm, which advantageously exploits the wavelet multirate signal representation to preserve the critical transient information. Simultaneous masking and temporal masking of the human auditory system are modeled by the perceptual wavelet packet transform via the frequency and temporal localization of speech components. The wavelet coefficients are used to calculate the Bark spreading energy and temporal spreading energy, from which a time-frequency masking threshold is deduced to adaptively adjust the subtraction parameters of the proposed method. An unvoiced speech enhancement algorithm is also integrated into the system to improve the intelligibility of speech. Through rigorous objective and subjective evaluations, it is shown that the proposed speech enhancement system is capable of reducing noise with little speech degradation in adverse noise environments and the overall performance is superior to several competitive methods.
引用
收藏
页码:877 / 889
页数:13
相关论文
共 50 条
  • [41] An automatic detection of focal EEG signals using new class of time-frequency localized orthogonal wavelet filter banks
    Sharma, Manish
    Dhere, Abhinav
    Pachori, Ram Bilas
    Acharya, U. Rajendra
    [J]. KNOWLEDGE-BASED SYSTEMS, 2017, 118 : 217 - 227
  • [42] An Effective Target Speech Enhancement with Single Acoustic Vector Sensor Based on the Speech Time-Frequency Sparsity
    Zou, Y. X.
    Wang, Y. Q.
    Wang, Peng
    Ritz, C. H.
    Xi, Jiangtao
    [J]. 2014 19TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2014, : 547 - 551
  • [43] Speech enhancement with natural sounding residual noise based on connected time-frequency speech presence regions
    Sorensen, KV
    Andersen, SV
    [J]. EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2005, 2005 (18) : 2954 - 2964
  • [44] Speech Enhancement with Natural Sounding Residual Noise Based on Connected Time-Frequency Speech Presence Regions
    Karsten Vandborg Sørensen
    Søren Vang Andersen
    [J]. EURASIP Journal on Advances in Signal Processing, 2005
  • [45] Drone Detection Method Based on the Time-Frequency Complementary Enhancement Model
    Dong, Hao
    Liu, Jun
    Wang, Chenguang
    Cao, Huiliang
    Shen, Chong
    Tang, Jun
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72 : 1 - 12
  • [46] Time-frequency concentration enhancement for time-phase modulation based on generalized S transform
    Department of Electronic Information Engineering, Naval Aeronautical and Astronautical University, Shandong Yantai 264001, China
    不详
    [J]. Beijing Youdian Daxue Xuebao, 1 (125-128):
  • [47] TIME-FREQUENCY MASKING-BASED SPEECH ENHANCEMENT USING GENERATIVE ADVERSARIAL NETWORK
    Soni, Meet H.
    Shah, Neil
    Patil, Hemant A.
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5039 - 5043
  • [48] A comparison between Fourier and biological auditory based time-frequency distributions, applied to the speech signals
    Souza, MN
    Caloba, LP
    [J]. PROCEEDINGS OF THE 39TH MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS I-III, 1996, : 807 - 810
  • [49] Speech enhancement system based on auditory system and time-delay neural network
    Choi, Jae-Seung
    Park, Seung-Jin
    [J]. ADAPTIVE AND NATURAL COMPUTING ALGORITHMS, PT 2, 2007, 4432 : 153 - +
  • [50] WAVELET-BASED TIME-FREQUENCY CONTROL OF A FLYWHEEL ENERGY STORAGE SYSTEM
    Lewallen, Colby
    [J]. PROCEEDINGS OF THE ASME INTERNATIONAL MECHANICAL ENGINEERING CONGRESS AND EXPOSITION, 2016, VOL. 4B, 2017,