A comprehensive noise robust speech parameterization algorithm using wavelet packet decomposition-based denoising and speech feature representation techniques

被引:0
|
作者
Kotnik, Bojan [1 ]
Kacic, Zdravko [1 ]
机构
[1] Univ Maribor, Fac Elect Engn & Comp Sci, SLO-2000 Maribor, Slovenia
关键词
D O I
10.1155/2007/64102
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper concerns the problem of automatic speech recognition in noise-intense and adverse environments. The main goal of the proposed work is the definition, implementation, and evaluation of a novel noise robust speech signal parameterization algorithm. The proposed procedure is based on time-frequency speech signal representation using wavelet packet decomposition. A new modified soft thresholding algorithm based on time-frequency adaptive threshold determination was developed to efficiently reduce the level of additive noise in the input noisy speech signal. A two-stage Gaussian mixture model (GMM)-based classifier was developed to perform speech/nonspeech as well as voiced/unvoiced classification. The adaptive topology of the wavelet packet decomposition tree based on voiced/unvoiced detection was introduced to separately analyze voiced and unvoiced segments of the speech signal. The main feature vector consists of a combination of log-root compressed wavelet packet parameters, and autoregressive parameters. The final output feature vector is produced using a two-staged feature vector postprocessing procedure. In the experimental framework, the noisy speech databases Aurora 2 and Aurora 3 were applied together with corresponding standardized acoustical model training/testing procedures. The automatic speech recognition performance achieved using the proposed noise robust speech parameterization procedure was compared to the standardized mel-frequency cepstral coefficient ( MFCC) feature extraction procedures ETSI ES 201 108 and ETSI ES 202 050. Copyright (C) 2007 B. Kotnik and Z. Kacic.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] A Comprehensive Noise Robust Speech Parameterization Algorithm Using Wavelet Packet Decomposition-Based Denoising and Speech Feature Representation Techniques
    Bojan Kotnik
    Zdravko Kačič
    [J]. EURASIP Journal on Advances in Signal Processing, 2007
  • [2] A noise robust feature extraction algorithm using joint wavelet packet subband decomposition and AR modeling of speech signals
    Kotnik, Bojan
    Kacic, Zdravko
    [J]. SIGNAL PROCESSING, 2007, 87 (06) : 1202 - 1223
  • [3] Speech Denoising Using Discrete Wavelet Packet Decomposition Technique
    Oktar, Mehmet Alper
    Nibouche, Mokhtar
    Baltaci, Yusuf
    [J]. 2016 24TH SIGNAL PROCESSING AND COMMUNICATION APPLICATION CONFERENCE (SIU), 2016, : 817 - 820
  • [4] Noise robust speech parameterization using multiresolution feature extraction
    Hariharan, R
    Kiss, I
    Viikki, O
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (08): : 856 - 865
  • [5] Noise suppression based on wavelet packet decomposition and quantile noise estimation for robust automatic speech recognition
    Rank, Erhard
    Van Pham, Tuan
    Kubin, Gernot
    [J]. 2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 477 - 480
  • [6] Wavelet-based denoising for robust feature extraction for speech recognition
    Farooq, O
    Datta, S
    [J]. ELECTRONICS LETTERS, 2003, 39 (01) : 163 - 165
  • [7] Robust Feature Extracting of Speech Signal Based on Wavelet Packet Transform
    Han Zhiyan
    Wang Jian
    Lun Shuxian
    Wang Xu
    [J]. PROCEEDINGS OF THE 29TH CHINESE CONTROL CONFERENCE, 2010, : 2832 - 2837
  • [8] Research on Key Parameters of Speech Denoising Algorithm Based on Wavelet Packet Transform
    Du, Ligang
    Xu, Ru
    Xu, Fang
    Wang, Deqing
    Chen, Huabin
    [J]. PROCEEDINGS OF 2010 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY (ICCSIT 2010), VOL 6, 2010, : 551 - 556
  • [9] Denoising Speech Based on Deep Learning and Wavelet Decomposition
    Wang, Li
    Zheng, Weiguang
    Ma, Xiaojun
    Lin, Shiming
    [J]. SCIENTIFIC PROGRAMMING, 2021, 2021
  • [10] Speech Denoising Based on Sparse Representation Algorithm
    Zhou, Yan
    Zhao, Heming
    Chen, Xueqin
    Liu, Tao
    Wu, Di
    Shang, Li
    [J]. INTELLIGENT COMPUTING THEORIES AND APPLICATION, ICIC 2016, PT II, 2016, 9772 : 202 - 211