Analysis and design of Wavelet-Packet Cepstral coefficients for automatic speech recognition

被引：32

作者：

Pavez, Eduardo ^{[1
]}

Silva, Jorge F. ^{[1
]}

机构：

[1] Univ Chile, Dept Elect Engn, Santiago 4123, Chile

来源：

SPEECH COMMUNICATION | 2012年 / 54卷 / 06期

关键词：

Wavelet Packets; Filter-bank analysis; Automatic speech recognition; Filter-bank selection; Cepstral coefficients; The Gray code; SAMPLING THEOREM; MARKOV-MODELS; SIGNAL; REPRESENTATIONS; FILTERS;

D O I：

10.1016/j.specom.2012.02.002

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This work proposes using Wavelet-Packet Cepstral coefficients (WPPCs) as an alternative way to do filter-bank energy-based feature extraction (FE) for automatic speech recognition (ASR). The rich coverage of time-frequency properties of Wavelet Packets (WPs) is used to obtain new sets of acoustic features, in which competitive and better performances are obtained with respect to the widely adopted Mel-Frequency Cepstral coefficients (MFCCs) in the TIMIT corpus. In the analysis, concrete filter-bank design considerations are stipulated to obtain most of the phone-discriminating information embedded in the speech signal, where the filter-bank frequency selectivity, and better discrimination in the lower frequency range [200 Hz-1 kHz] of the acoustic spectrum are important aspects to consider. (C) 2012 Elsevier B.V. All rights reserved.

引用

页码：814 / 835

页数：22

共 50 条

[1] Speech Emotion Recognition Based on Coiflet Wavelet Packet Cepstral Coefficients
Huang, Yongming
Wu, Ao
Zhang, Guobao
Li, Yue
[J]. PATTERN RECOGNITION (CCPR 2014), PT II, 2014, 484 : 436 - 443
[2] Speech emotion recognition based on deep belief networks and wavelet packet cepstral coefficients
Huang Y.
Wu A.
Zhang G.
Li Y.
[J]. 1600, UK Simulation Society, Clifton Lane, Nottingham, NG11 8NS, United Kingdom (17): : 28.1 - 28.5
[3] Gammatone Wavelet Cepstral Coefficients for Robust Speech Recognition
Adiga, Aniruddha
Magimai-Doss, Mathew
Seelamantula, Chandra Sekhar
[J]. 2013 IEEE INTERNATIONAL CONFERENCE OF IEEE REGION 10 (TENCON), 2013,
[4] Wavelet packet cepstral analysis for speaker recognition
Kinney, A
Stevens, J
[J]. THIRTY-SIXTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS - CONFERENCE RECORD, VOLS 1 AND 2, CONFERENCE RECORD, 2002, : 206 - 209
[5] WAVELET BASED CEPSTRAL COEFFICIENTS FOR NEURAL NETWORK SPEECH RECOGNITION
Adam, T. B.
Salam, M. S.
Gunawan, T. S.
[J]. 2013 IEEE INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING APPLICATIONS (IEEE ICSIPA 2013), 2013, : 447 - 451
[6] Robust Automatic Speech Recognition Features using Complex Wavelet Packet Transform Coefficients
Sen, Tjong Wan
Trilaksono, Bambang Riyanto
Arman, Arry Akhmad
Mandala, Rila
[J]. JOURNAL OF ICT RESEARCH AND APPLICATIONS, 2009, 3 (02) : 123 - 134
[7] Chip design of mel frequency cepstral coefficients for speech recognition
Wang, JC
Wang, JF
Weng, YS
[J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 3658 - 3661
[8] New wavelet packet model for automatic speech recognition system
Karam, JR
Phillips, WJ
Robertson, W
Artimy, MM
[J]. CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING 2001, VOLS I AND II, CONFERENCE PROCEEDINGS, 2001, : 511 - 514
[9] Combining Mel Frequency Cepstral Coefficients and Fractal Dimensions for Automatic Speech Recognition
Ezeiza, Aitzol
Lopez de Ipina, Karmele
Hernandez, Carmen
Barroso, Nora
[J]. ADVANCES IN NONLINEAR SPEECH PROCESSING, 2011, 7015 : 183 - +
[10] Feature Extraction Method Human Factor Cepstral Coefficients in Automatic Speech Recognition
Rahali, Hajer
Hajaiej, Zied
Ellouze, Noureddine
[J]. 2014 9TH INTERNATIONAL SYMPOSIUM ON COMMUNICATION SYSTEMS, NETWORKS & DIGITAL SIGNAL PROCESSING (CSNDSP), 2014, : 266 - 270

← 1 2 3 4 5 →