A comprehensive noise robust speech parameterization algorithm using wavelet packet decomposition-based denoising and speech feature representation techniques

被引:0
|
作者
Kotnik, Bojan [1 ]
Kacic, Zdravko [1 ]
机构
[1] Univ Maribor, Fac Elect Engn & Comp Sci, SLO-2000 Maribor, Slovenia
关键词
D O I
10.1155/2007/64102
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper concerns the problem of automatic speech recognition in noise-intense and adverse environments. The main goal of the proposed work is the definition, implementation, and evaluation of a novel noise robust speech signal parameterization algorithm. The proposed procedure is based on time-frequency speech signal representation using wavelet packet decomposition. A new modified soft thresholding algorithm based on time-frequency adaptive threshold determination was developed to efficiently reduce the level of additive noise in the input noisy speech signal. A two-stage Gaussian mixture model (GMM)-based classifier was developed to perform speech/nonspeech as well as voiced/unvoiced classification. The adaptive topology of the wavelet packet decomposition tree based on voiced/unvoiced detection was introduced to separately analyze voiced and unvoiced segments of the speech signal. The main feature vector consists of a combination of log-root compressed wavelet packet parameters, and autoregressive parameters. The final output feature vector is produced using a two-staged feature vector postprocessing procedure. In the experimental framework, the noisy speech databases Aurora 2 and Aurora 3 were applied together with corresponding standardized acoustical model training/testing procedures. The automatic speech recognition performance achieved using the proposed noise robust speech parameterization procedure was compared to the standardized mel-frequency cepstral coefficient ( MFCC) feature extraction procedures ETSI ES 201 108 and ETSI ES 202 050. Copyright (C) 2007 B. Kotnik and Z. Kacic.
引用
收藏
页数:20
相关论文
共 50 条
  • [21] FEATURE EXTRACTION ALGORITHM USING NEW CEPSTRAL TECHNIQUES FOR ROBUST SPEECH RECOGNITION
    Korba, Mohamed Cherif Amara
    Bourouba, Houcine
    Djemili, Rafik
    [J]. MALAYSIAN JOURNAL OF COMPUTER SCIENCE, 2020, 33 (02) : 90 - 101
  • [22] A new speech enhancement algorithm using wavelet packet transform
    Guo, Jichang
    Wang, Wenliang
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE INFORMATION COMPUTING AND AUTOMATION, VOLS 1-3, 2008, : 504 - 506
  • [23] Speech Denoising Method Based on Connecting Spectral Subtraction Combined with Wavelet Packet
    Li, Yue-sheng
    Lv, Cheng-guo
    Li, Sheng-nan
    [J]. INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE (ICCSAI 2014), 2015, : 64 - 66
  • [24] Noise-robust speech feature processing with empirical mode decomposition
    Kuo-Hau Wu
    Chia-Ping Chen
    Bing-Feng Yeh
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2011
  • [25] Noise-robust speech feature processing with empirical mode decomposition
    Wu, Kuo-Hau
    Chen, Chia-Ping
    Yeh, Bing-Feng
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2011, : 1 - 9
  • [26] Speech Enhancement Using the Combination of Adaptive Wavelet Threshold and Spectral Subtraction based on Wavelet Packet Decomposition
    Li Ruwei
    Bao Changchun
    Xia Bingyin
    Jia Maoshen
    [J]. PROCEEDINGS OF 2012 IEEE 11TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP) VOLS 1-3, 2012, : 481 - 484
  • [27] Speech enhancement using voiced speech probability based wavelet decomposition
    Bhowmick, Anirban
    Chandra, Mahesh
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2017, 62 : 706 - 718
  • [28] A robust speech feature - Perceptive Scalogram based on wavelet analysis
    Yao, KS
    Cao, ZG
    [J]. ICSP '98: 1998 FOURTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1998, : 662 - 665
  • [29] Dereverberation based on Wavelet Packet Filtering for Robust Automatic Speech Recognition
    Gomez, Randy
    Kawahara, Tatsuya
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1242 - 1245
  • [30] Wavelet packet decomposition-based fuzzy clustering algorithm for gene expression data
    Cui, Guangzhao
    Cao, Xianghong
    Wang, Yanfeng
    Cao, Lingzhi
    Huang, Buyi
    Yang, Cunxiang
    [J]. 2006 IEEE ASIA PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS, 2006, : 1027 - +