Analysis and design of Wavelet-Packet Cepstral coefficients for automatic speech recognition

被引:32
|
作者
Pavez, Eduardo [1 ]
Silva, Jorge F. [1 ]
机构
[1] Univ Chile, Dept Elect Engn, Santiago 4123, Chile
关键词
Wavelet Packets; Filter-bank analysis; Automatic speech recognition; Filter-bank selection; Cepstral coefficients; The Gray code; SAMPLING THEOREM; MARKOV-MODELS; SIGNAL; REPRESENTATIONS; FILTERS;
D O I
10.1016/j.specom.2012.02.002
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This work proposes using Wavelet-Packet Cepstral coefficients (WPPCs) as an alternative way to do filter-bank energy-based feature extraction (FE) for automatic speech recognition (ASR). The rich coverage of time-frequency properties of Wavelet Packets (WPs) is used to obtain new sets of acoustic features, in which competitive and better performances are obtained with respect to the widely adopted Mel-Frequency Cepstral coefficients (MFCCs) in the TIMIT corpus. In the analysis, concrete filter-bank design considerations are stipulated to obtain most of the phone-discriminating information embedded in the speech signal, where the filter-bank frequency selectivity, and better discrimination in the lower frequency range [200 Hz-1 kHz] of the acoustic spectrum are important aspects to consider. (C) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:814 / 835
页数:22
相关论文
共 50 条
  • [11] An architecture for wavelet-packet based speech enhancement for hearing aids
    Trenas, MA
    López, J
    Zapata, EL
    Argüello, F
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 849 - 852
  • [12] Dereverberation based on Wavelet Packet Filtering for Robust Automatic Speech Recognition
    Gomez, Randy
    Kawahara, Tatsuya
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1242 - 1245
  • [13] The usage of wavelet packet transformation in automatic noisy speech recognition systems
    Kotnik, B
    Kacic, Z
    Horvat, B
    [J]. IEEE REGION 8 EUROCON 2003, VOL B, PROCEEDINGS: COMPUTER AS A TOOL, 2003, : 131 - 134
  • [14] Exploiting independent filter bandwidth of human factor cepstral coefficients in automatic speech recognition
    Skowronski, MD
    Harris, JG
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2004, 116 (03): : 1774 - 1780
  • [15] Damped Oscillator Cepstral Coefficients for Robust Speech Recognition
    Mitra, Vikramjit
    Franco, Horacio
    Graciarena, Martin
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 886 - 890
  • [16] Adaptive wavelet-packet analysis for audio coding purposes
    Reyes, NR
    Zurera, MR
    Ferreras, FL
    Amores, PJ
    [J]. SIGNAL PROCESSING, 2003, 83 (05) : 919 - 929
  • [17] Automatic speech recognition based on cepstral coefficients and a Mel-based discrete energy operator
    Tolba, H
    O'Shaughnessy, D
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 973 - 976
  • [18] Enhanced Automatic Speech Recognition System Based on Enhancing Power-Normalized Cepstral Coefficients
    Tamazin, Mohamed
    Gouda, Ahmed
    Khedr, Mohamed
    [J]. APPLIED SCIENCES-BASEL, 2019, 9 (10):
  • [19] Wavelet and wavelet-packet analysis of lamb wave signatures in laser ultrasonics
    Kercel, SW
    Klein, MB
    Pouet, B
    [J]. WAVELET APPLICATIONS VII, 2000, 4056 : 308 - 317
  • [20] CEPSTRAL NOISE SUBTRACTION FOR ROBUST AUTOMATIC SPEECH RECOGNITION
    Rehr, Robert
    Gerkmann, Timo
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 375 - 378