SPEECH ENHANCEMENT BASED ON JOINT TIME-FREQUENCY SEGMENTATION

被引:2
|
作者
Tantibundhit, C. [1 ,2 ]
Pernkopf, F. [2 ]
Kubin, G. [2 ]
机构
[1] Thammasat Univ, Med Intelligence & Innovat Lab, Bangkok, Thailand
[2] Graz Univ Technol, Signal Proc & Speech Commun Lab, A-8010 Graz, Austria
关键词
Speech enhancement; transient component; speech intelligibility; wavelet packet transform; SET;
D O I
10.1109/ICASSP.2009.4960673
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We present an algorithm to decompose speech into transient and non-transient components. Our algorithm, the joint time-frequency segmentation algorithm, uses the wavelet packet coefficients of the speech signal and represents them as tiles of a time-frequency representation adapted to the characteristics of the signal itself. Any wavelet packet coefficient, whose tiling height is larger than or equal to the tiling width is characterized as a transient coefficient and vice versa for the non-transient coefficient. The transient component is selectively amplified and recombined with the original speech to generate the modified speech with energy adjusted to be equal to the energy of the original speech. The psychoacoustic tests performed with fourteen human listeners show that the speech modification significantly improves speech intelligibility in background noise, i.e., for 10% absolute at 0dB to 31% absolute at - 30dB.
引用
收藏
页码:4673 / +
页数:2
相关论文
共 50 条
  • [21] Speech enhancement with natural sounding residual noise based on connected time-frequency speech presence regions
    Sorensen, KV
    Andersen, SV
    [J]. EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2005, 2005 (18) : 2954 - 2964
  • [22] An Effective Target Speech Enhancement with Single Acoustic Vector Sensor Based on the Speech Time-Frequency Sparsity
    Zou, Y. X.
    Wang, Y. Q.
    Wang, Peng
    Ritz, C. H.
    Xi, Jiangtao
    [J]. 2014 19TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2014, : 547 - 551
  • [23] Speech Enhancement with Natural Sounding Residual Noise Based on Connected Time-Frequency Speech Presence Regions
    Karsten Vandborg Sørensen
    Søren Vang Andersen
    [J]. EURASIP Journal on Advances in Signal Processing, 2005
  • [24] PHASE RECONSTRUCTION METHOD BASED ON TIME-FREQUENCY DOMAIN HARMONIC STRUCTURE FOR SPEECH ENHANCEMENT
    Wakabayashi, Yukoh
    Fukumori, Takahiro
    Nakayama, Masato
    Nishiura, Takanobu
    Yamashita, Yoichi
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5560 - 5564
  • [25] TIME-FREQUENCY MASKING-BASED SPEECH ENHANCEMENT USING GENERATIVE ADVERSARIAL NETWORK
    Soni, Meet H.
    Shah, Neil
    Patil, Hemant A.
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5039 - 5043
  • [26] A Wavelet-Based Denoising System Using Time-Frequency Adaptation for Speech Enhancement
    Wang, Kun-Ching
    Chin, Chuin-Li
    Tsai, Yi-Hsing
    [J]. 2009 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, 2009, : 114 - 117
  • [27] SPEECH PRESENCE PROBABILITY ESTIMATION BASED ON INTEGRATED TIME-FREQUENCY MINIMUM TRACKING FOR SPEECH ENHANCEMENT IN ADVERSE ENVIRONMENTS
    Fu, Zhong-Hua
    Wang, Jhing-Fa
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4258 - 4261
  • [28] A time-frequency fusion model for multi-channel speech enhancement
    Zeng, Xiao
    Xu, Shiyun
    Wang, Mingjiang
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2024, 2024 (01):
  • [29] Deep Learning Assisted Time-Frequency Processing for Speech Enhancement on Drones
    Wang, Lin
    Cavallaro, Andrea
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2021, 5 (06): : 871 - 881
  • [30] Single channel speech enhancement via time-frequency dictionary learning
    HUANG Jianjun
    ZHANG Xiongwei
    ZHANG Yafei
    ZOU Xia
    [J]. Chinese Journal of Acoustics, 2013, 32 (01) : 90 - 102