Analysis and design of Wavelet-Packet Cepstral coefficients for automatic speech recognition

被引:32
|
作者
Pavez, Eduardo [1 ]
Silva, Jorge F. [1 ]
机构
[1] Univ Chile, Dept Elect Engn, Santiago 4123, Chile
关键词
Wavelet Packets; Filter-bank analysis; Automatic speech recognition; Filter-bank selection; Cepstral coefficients; The Gray code; SAMPLING THEOREM; MARKOV-MODELS; SIGNAL; REPRESENTATIONS; FILTERS;
D O I
10.1016/j.specom.2012.02.002
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This work proposes using Wavelet-Packet Cepstral coefficients (WPPCs) as an alternative way to do filter-bank energy-based feature extraction (FE) for automatic speech recognition (ASR). The rich coverage of time-frequency properties of Wavelet Packets (WPs) is used to obtain new sets of acoustic features, in which competitive and better performances are obtained with respect to the widely adopted Mel-Frequency Cepstral coefficients (MFCCs) in the TIMIT corpus. In the analysis, concrete filter-bank design considerations are stipulated to obtain most of the phone-discriminating information embedded in the speech signal, where the filter-bank frequency selectivity, and better discrimination in the lower frequency range [200 Hz-1 kHz] of the acoustic spectrum are important aspects to consider. (C) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:814 / 835
页数:22
相关论文
共 50 条
  • [1] Speech Emotion Recognition Based on Coiflet Wavelet Packet Cepstral Coefficients
    Huang, Yongming
    Wu, Ao
    Zhang, Guobao
    Li, Yue
    [J]. PATTERN RECOGNITION (CCPR 2014), PT II, 2014, 484 : 436 - 443
  • [2] Speech emotion recognition based on deep belief networks and wavelet packet cepstral coefficients
    Huang Y.
    Wu A.
    Zhang G.
    Li Y.
    [J]. 1600, UK Simulation Society, Clifton Lane, Nottingham, NG11 8NS, United Kingdom (17): : 28.1 - 28.5
  • [3] Gammatone Wavelet Cepstral Coefficients for Robust Speech Recognition
    Adiga, Aniruddha
    Magimai-Doss, Mathew
    Seelamantula, Chandra Sekhar
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE OF IEEE REGION 10 (TENCON), 2013,
  • [4] Wavelet packet cepstral analysis for speaker recognition
    Kinney, A
    Stevens, J
    [J]. THIRTY-SIXTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS - CONFERENCE RECORD, VOLS 1 AND 2, CONFERENCE RECORD, 2002, : 206 - 209
  • [5] WAVELET BASED CEPSTRAL COEFFICIENTS FOR NEURAL NETWORK SPEECH RECOGNITION
    Adam, T. B.
    Salam, M. S.
    Gunawan, T. S.
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING APPLICATIONS (IEEE ICSIPA 2013), 2013, : 447 - 451
  • [6] Robust Automatic Speech Recognition Features using Complex Wavelet Packet Transform Coefficients
    Sen, Tjong Wan
    Trilaksono, Bambang Riyanto
    Arman, Arry Akhmad
    Mandala, Rila
    [J]. JOURNAL OF ICT RESEARCH AND APPLICATIONS, 2009, 3 (02) : 123 - 134
  • [7] Chip design of mel frequency cepstral coefficients for speech recognition
    Wang, JC
    Wang, JF
    Weng, YS
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 3658 - 3661
  • [8] New wavelet packet model for automatic speech recognition system
    Karam, JR
    Phillips, WJ
    Robertson, W
    Artimy, MM
    [J]. CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING 2001, VOLS I AND II, CONFERENCE PROCEEDINGS, 2001, : 511 - 514
  • [9] Combining Mel Frequency Cepstral Coefficients and Fractal Dimensions for Automatic Speech Recognition
    Ezeiza, Aitzol
    Lopez de Ipina, Karmele
    Hernandez, Carmen
    Barroso, Nora
    [J]. ADVANCES IN NONLINEAR SPEECH PROCESSING, 2011, 7015 : 183 - +
  • [10] Feature Extraction Method Human Factor Cepstral Coefficients in Automatic Speech Recognition
    Rahali, Hajer
    Hajaiej, Zied
    Ellouze, Noureddine
    [J]. 2014 9TH INTERNATIONAL SYMPOSIUM ON COMMUNICATION SYSTEMS, NETWORKS & DIGITAL SIGNAL PROCESSING (CSNDSP), 2014, : 266 - 270