A Wavelet-Based Denoising System Using Time-Frequency Adaptation for Speech Enhancement

被引:1
|
作者
Wang, Kun-Ching [1 ]
Chin, Chuin-Li [2 ]
Tsai, Yi-Hsing [3 ]
机构
[1] Shin Chien Univ, Dept Informat Technol & Commun, Kaohsiung, Taiwan
[2] Chung Shan Med Univ, Dept Appl Informat Sci, Taichung, Taiwan
[3] Ind Technol Res Inst, Informat & Commun Res Lab, Hsinchu, Taiwan
关键词
wavelet denoising system; time-frequency adaptation; voiced/unvoiced decision; speech denoising;
D O I
10.1109/IALP.2009.32
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a novel wavelet denoising system using time-frequency adaptation for providing speech enhancement robustness to non-stationary and colored noise. Different from the conventional methods in threshold choosing, e.g. invariant threshold and time-variant threshold, the proposed wavelet coefficient threshold (WCT) is adapted by both time and frequency information. In order to further improve the intelligibility of the processed speech signal, we apply appropriate wavelet thresholding according to voiced/unvoiced decision. Simulation results showed that the proposed system is capable of reducing noise with little speech degradation and the overall performance is superior to several competitive methods in both objective and subjective evaluations.
引用
收藏
页码:114 / 117
页数:4
相关论文
共 50 条
  • [41] Differential neural responses to acupuncture revealed by MEG using wavelet-based time-frequency analysis: A pilot study
    You, Youbo
    Bai, Lijun
    Dai, Ruwei
    Xue, Ting
    Zhong, Chongguang
    Feng, Yuanyuan
    Wang, Hu
    Liu, Zhenyu
    Tian, Jie
    2011 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2011, : 7099 - 7102
  • [42] Joint Time-Frequency and Time Domain Learning for Speech Enhancement
    Tang, Chuanxin
    Luo, Chong
    Zhao, Zhiyuan
    Xie, Wenxuan
    Zeng, Wenjun
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3816 - 3822
  • [43] Time-frequency localization method for singular signal detection using wavelet-based Holder exponent and Hilbert transform
    Deng, Xiaoyan
    Wang, Qiaohua
    Chen, Xiaokun
    CISP 2008: FIRST INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOL 4, PROCEEDINGS, 2008, : 266 - +
  • [44] Speech enhancement based on adaptive wavelet denoising on multitaper spectrum
    Hsung, Tai-Chiu
    Lun, Daniel Pak-Kong
    PROCEEDINGS OF 2008 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-10, 2008, : 1700 - 1703
  • [45] Nonstationary Vibration Signal Analysis Using Wavelet-Based Time-Frequency Filter and Wigner-Ville Distribution
    Xu, Chang
    Wang, Cong
    Liu, Wei
    JOURNAL OF VIBRATION AND ACOUSTICS-TRANSACTIONS OF THE ASME, 2016, 138 (05):
  • [46] Dual Channel Coherence Based Speech Enhancement with Wavelet Denoising
    Bagekar, Snehal
    Tank, Vanita
    PROCEEDINGS OF THE 2018 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS), 2018, : 1826 - 1830
  • [47] A Time-Frequency Attention Module for Neural Speech Enhancement
    Zhang, Qiquan
    Qian, Xinyuan
    Ni, Zhaoheng
    Nicolson, Aaron
    Ambikairajah, Eliathamby
    Li, Haizhou
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 462 - 475
  • [48] Integrated speech enhancement and coding in the time-frequency domain
    Drygajlo, A
    Carnero, B
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1183 - 1186
  • [49] Adaptive time-frequency data fusion for speech enhancement
    Shi, G
    Aarabi, P
    Lazic, N
    FUSION 2003: PROCEEDINGS OF THE SIXTH INTERNATIONAL CONFERENCE OF INFORMATION FUSION, VOLS 1 AND 2, 2003, : 394 - 399
  • [50] A time-frequency smoothing neural network for speech enhancement
    Yuan, Wenhao
    SPEECH COMMUNICATION, 2020, 124 : 75 - 84