A comprehensive noise robust speech parameterization algorithm using wavelet packet decomposition-based denoising and speech feature representation techniques

被引：0

作者：

Kotnik, Bojan ^{[1
]}

Kacic, Zdravko ^{[1
]}

机构：

[1] Univ Maribor, Fac Elect Engn & Comp Sci, SLO-2000 Maribor, Slovenia

来源：

EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING | 2007年 / 2007卷 / 1期

关键词：

D O I：

10.1155/2007/64102

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This paper concerns the problem of automatic speech recognition in noise-intense and adverse environments. The main goal of the proposed work is the definition, implementation, and evaluation of a novel noise robust speech signal parameterization algorithm. The proposed procedure is based on time-frequency speech signal representation using wavelet packet decomposition. A new modified soft thresholding algorithm based on time-frequency adaptive threshold determination was developed to efficiently reduce the level of additive noise in the input noisy speech signal. A two-stage Gaussian mixture model (GMM)-based classifier was developed to perform speech/nonspeech as well as voiced/unvoiced classification. The adaptive topology of the wavelet packet decomposition tree based on voiced/unvoiced detection was introduced to separately analyze voiced and unvoiced segments of the speech signal. The main feature vector consists of a combination of log-root compressed wavelet packet parameters, and autoregressive parameters. The final output feature vector is produced using a two-staged feature vector postprocessing procedure. In the experimental framework, the noisy speech databases Aurora 2 and Aurora 3 were applied together with corresponding standardized acoustical model training/testing procedures. The automatic speech recognition performance achieved using the proposed noise robust speech parameterization procedure was compared to the standardized mel-frequency cepstral coefficient ( MFCC) feature extraction procedures ETSI ES 201 108 and ETSI ES 202 050. Copyright (C) 2007 B. Kotnik and Z. Kacic.

引用

页数：20

共 50 条

[21] FEATURE EXTRACTION ALGORITHM USING NEW CEPSTRAL TECHNIQUES FOR ROBUST SPEECH RECOGNITION
Korba, Mohamed Cherif Amara
Bourouba, Houcine
Djemili, Rafik
[J]. MALAYSIAN JOURNAL OF COMPUTER SCIENCE, 2020, 33 (02) : 90 - 101
[22] A new speech enhancement algorithm using wavelet packet transform
Guo, Jichang
Wang, Wenliang
[J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE INFORMATION COMPUTING AND AUTOMATION, VOLS 1-3, 2008, : 504 - 506
[23] Speech Denoising Method Based on Connecting Spectral Subtraction Combined with Wavelet Packet
Li, Yue-sheng
Lv, Cheng-guo
Li, Sheng-nan
[J]. INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE (ICCSAI 2014), 2015, : 64 - 66
[24] Noise-robust speech feature processing with empirical mode decomposition
Kuo-Hau Wu
Chia-Ping Chen
Bing-Feng Yeh
[J]. EURASIP Journal on Audio, Speech, and Music Processing, 2011
[25] Noise-robust speech feature processing with empirical mode decomposition
Wu, Kuo-Hau
Chen, Chia-Ping
Yeh, Bing-Feng
[J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2011, : 1 - 9
[26] Speech Enhancement Using the Combination of Adaptive Wavelet Threshold and Spectral Subtraction based on Wavelet Packet Decomposition
Li Ruwei
Bao Changchun
Xia Bingyin
Jia Maoshen
[J]. PROCEEDINGS OF 2012 IEEE 11TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP) VOLS 1-3, 2012, : 481 - 484
[27] Speech enhancement using voiced speech probability based wavelet decomposition
Bhowmick, Anirban
Chandra, Mahesh
[J]. COMPUTERS & ELECTRICAL ENGINEERING, 2017, 62 : 706 - 718
[28] A robust speech feature - Perceptive Scalogram based on wavelet analysis
Yao, KS
Cao, ZG
[J]. ICSP '98: 1998 FOURTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1998, : 662 - 665
[29] Dereverberation based on Wavelet Packet Filtering for Robust Automatic Speech Recognition
Gomez, Randy
Kawahara, Tatsuya
[J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1242 - 1245
[30] Wavelet packet decomposition-based fuzzy clustering algorithm for gene expression data
Cui, Guangzhao
Cao, Xianghong
Wang, Yanfeng
Cao, Lingzhi
Huang, Buyi
Yang, Cunxiang
[J]. 2006 IEEE ASIA PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS, 2006, : 1027 - +

← 1 2 3 4 5 →