Speech enhancement based on adaptive wavelet denoising on multitaper spectrum

被引:0
|
作者
Hsung, Tai-Chiu [1 ]
Lun, Daniel Pak-Kong [1 ]
机构
[1] Hong Kong Polytech Univ, Ctr Multimedia Signal Proc, Elect & Informat Engn Dept, Hong Kong, Hong Kong, Peoples R China
关键词
D O I
10.1109/ISCAS.2008.4541764
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Classical speech enhancement algorithms often require a good estimation of the short-time power spectrum using, for instance, the periodogram methods. However, it is well known that traditional periodogram methods are prone to induce large variance, hence produces the "musical noise" after enhancement. To alleviate this problem, multitaper spectrum (NITS) estimators with wavelet denoising were proposed. In this paper, we investigate the properties of the MTS of noisy speech signals. We find that, in the log NITS domain, the variance of noise varies according to the magnitude of the underlying speech spectrum. It implies that when applying wavelet denoising to the log NITS, the constant threshold used in the traditional methods is not appropriate. Based on this observation, we further develop a wavelet denoising method with adaptive threshold for estimating power spectrum using multitaper. Simulation results show that the spectrum estimated using the proposed method is consistently more accurate than the traditional uniform thresholding methods. Hence, it further improves the current speech enhancement algorithms using the MTS approaches.
引用
下载
收藏
页码:1700 / 1703
页数:4
相关论文
共 50 条
  • [1] Speech enhancement based on wavelet thresholding the multitaper spectrum
    Hu, Y
    Loizou, PC
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2004, 12 (01): : 59 - 67
  • [2] Psychoacoustical enhancement of speech based on multitaper spectrum
    School of Information Science and Engineering, Southeast University, Nanjing 210096, China
    不详
    Shengxue Xuebao, 2007, 3 (275-281):
  • [3] Adaptive wavelet denoising system for speech enhancement
    Xu, Lan
    Kwan, Hon Keung
    PROCEEDINGS OF 2008 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-10, 2008, : 3210 - 3213
  • [4] Speech enhancement based on multitaper spectrum and psychoacoustical weighting rule
    WU Hongwei WU Zhenyang ZHAO Li ( College of Information Science and Engineering
    Chinese Journal of Acoustics, 2007, (03) : 278 - 288
  • [5] A modified Wiener filtering method combined with wavelet thresholding multitaper spectrum for speech enhancement
    Yanna Ma
    Akinori Nishihara
    EURASIP Journal on Audio, Speech, and Music Processing, 2014
  • [6] A modified Wiener filtering method combined with wavelet thresholding multitaper spectrum for speech enhancement
    Ma, Yanna
    Nishihara, Akinori
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2014, : 1 - 11
  • [7] NMF-Based Speech Enhancement Using Multitaper Spectrum Estimation
    Attabi, Yazid
    Chung, Hanwook
    Champagne, Benoit
    Zhu, Wei-Ping
    2018 INTERNATIONAL CONFERENCE ON SIGNALS AND SYSTEMS (ICSIGSYS), 2018, : 36 - 41
  • [8] Dual Channel Coherence Based Speech Enhancement with Wavelet Denoising
    Bagekar, Snehal
    Tank, Vanita
    PROCEEDINGS OF THE 2018 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS), 2018, : 1826 - 1830
  • [9] Wavelet based adaptive algorithm for mammographic images enhancement and denoising
    Mencattini, A
    Caselli, F
    Salmeri, M
    Lojacono, R
    2005 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), VOLS 1-5, 2005, : 857 - 860
  • [10] IQ Evaluation Based Adaptive Wavelet Denoising and Enhancement for a VTRAN System
    Liu, Haoting
    Lu, Hanqing
    2008 IEEE/RSJ INTERNATIONAL CONFERENCE ON ROBOTS AND INTELLIGENT SYSTEMS, VOLS 1-3, CONFERENCE PROCEEDINGS, 2008, : 594 - 599