A comprehensive noise robust speech parameterization algorithm using wavelet packet decomposition-based denoising and speech feature representation techniques

被引:0
|
作者
Kotnik, Bojan [1 ]
Kacic, Zdravko [1 ]
机构
[1] Univ Maribor, Fac Elect Engn & Comp Sci, SLO-2000 Maribor, Slovenia
关键词
D O I
10.1155/2007/64102
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper concerns the problem of automatic speech recognition in noise-intense and adverse environments. The main goal of the proposed work is the definition, implementation, and evaluation of a novel noise robust speech signal parameterization algorithm. The proposed procedure is based on time-frequency speech signal representation using wavelet packet decomposition. A new modified soft thresholding algorithm based on time-frequency adaptive threshold determination was developed to efficiently reduce the level of additive noise in the input noisy speech signal. A two-stage Gaussian mixture model (GMM)-based classifier was developed to perform speech/nonspeech as well as voiced/unvoiced classification. The adaptive topology of the wavelet packet decomposition tree based on voiced/unvoiced detection was introduced to separately analyze voiced and unvoiced segments of the speech signal. The main feature vector consists of a combination of log-root compressed wavelet packet parameters, and autoregressive parameters. The final output feature vector is produced using a two-staged feature vector postprocessing procedure. In the experimental framework, the noisy speech databases Aurora 2 and Aurora 3 were applied together with corresponding standardized acoustical model training/testing procedures. The automatic speech recognition performance achieved using the proposed noise robust speech parameterization procedure was compared to the standardized mel-frequency cepstral coefficient ( MFCC) feature extraction procedures ETSI ES 201 108 and ETSI ES 202 050. Copyright (C) 2007 B. Kotnik and Z. Kacic.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] Subband Feature Statistics Normalization Techniques Based on a Discrete Wavelet Transform for Robust Speech Recognition
    Hung, Jeih-weih
    Fan, Hao-Teng
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2009, 16 (09) : 806 - 809
  • [32] Speech denoising based on Group Sparse Representation in the case of Gaussian Noise
    Liu, Hongqing
    Liu, Shujun
    Li, Yong
    Li, Dong
    Trieu-Kien Truong
    [J]. 2018 IEEE 23RD INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2018,
  • [33] Speech recognition using a proposed indexing tree algorithm based on wavelet packet transform
    Mabmoud, W.A.
    Juda, N.R.
    Gagad, N.H.
    [J]. Advances in Modelling and Analysis B, 2003, 46 (3-4): : 25 - 36
  • [34] Sequential MAP estimation based speech feature enhancement for noise robust speech recognition
    Jia, C
    Ding, P
    Xu, B
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 412 - 415
  • [35] An Adaptive Wavelet-Based Denoising Algorithm for Enhancing Speech in Non-stationary Noise Environment
    Wang, Kun-Ching
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2010, E93D (02): : 341 - 349
  • [36] A New Speech Denoising Algorithm Based on Bilateral Filtering and Wavelet Transform
    Liu, Caixia
    Hou, Yanyan
    Yang, Bin
    [J]. PROCEEDINGS OF THE 2017 4TH INTERNATIONAL CONFERENCE ON MACHINERY, MATERIALS AND COMPUTER (MACMC 2017), 2017, 150 : 649 - 653
  • [37] Front-end Feature Compensation and Denoising for Noise Robust Speech Emotion Recognition
    Chakraborty, Rupayan
    Panda, Ashish
    Pandharipande, Meghna
    Joshi, Sonal
    Kopparapu, Sunil Kumar
    [J]. INTERSPEECH 2019, 2019, : 3257 - 3261
  • [39] Silence and speech segmentation for noisy speech using a wavelet based algorithm
    Mei, XD
    Sun, SH
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2001, 10 (04) : 439 - 443
  • [40] Speech enhancement using adaptive threshold based on bi-orthogonal wavelet packet decomposition
    Li, Ruwei
    Bao, Changchun
    Dou, Huijing
    [J]. Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2008, 29 (10): : 2135 - 2140