Auditory ERB like admissible wavelet packet features for TIMIT phoneme recognition

被引:9
|
作者
Sahu, P. K. [1 ]
Biswas, Astik [1 ]
Bhowmick, Anirban [2 ]
Chandra, Mahesh [2 ]
机构
[1] Natl Inst Technol, Dept Elect Engn, Rourkela, India
[2] Birla Inst Technol, Dept ECE, Ranchi, Bihar, India
关键词
Speech recognition; Wavelet packets; ERB scale; WERBC; WMFCC; Phoneme recognition;
D O I
10.1016/j.jestch.2014.04.004
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In recent years wavelet transform has been found to be an effective tool for timeefrequency analysis. Wavelet transform has been used as feature extraction in speech recognition applications and it has proved to be an effective technique for unvoiced phoneme classification. In this paper a new filter structure using admissible wavelet packet is analyzed for English phoneme recognition. These filters have the benefit of having frequency bands spacing similar to the auditory Equivalent Rectangular Bandwidth (ERB) scale. Central frequencies of ERB scale are equally distributed along the frequency response of human cochlea. A new sets of features are derived using wavelet packet transform's multi-resolution capabilities and found to be better than conventional features for unvoiced phoneme problems. Some of the noises from NOISEX-92 database has been used for preparing the artificial noisy database to test the robustness of wavelet based features. Copyright (C) 2014, Karabuk University. Production and hosting by Elsevier B.V. All rights reserved.
引用
收藏
页码:145 / 151
页数:7
相关论文
共 32 条
  • [1] Feature extraction technique using ERB like wavelet sub-band periodic and aperiodic decomposition for TIMIT phoneme recognition
    Biswas, Astik
    Sahu, P.
    Bhowmick, Anirban
    Chandra, Mahesh
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2014, 17 (04) : 389 - 399
  • [2] Admissible wavelet packet sub-band-based harmonic energy features for Hindi phoneme recognition
    Biswas, Astik
    Sahu, Prasanna Kumar
    Bhowmick, Anirban
    Chandra, Mahesh
    [J]. IET SIGNAL PROCESSING, 2015, 9 (06) : 511 - 519
  • [3] 16-band filter derived by admissible wavelet packet for phoneme recognition
    Farooq, O
    Datta, S
    [J]. PROCEEDINGS OF THE 6TH JOINT CONFERENCE ON INFORMATION SCIENCES, 2002, : 192 - 195
  • [4] Auditory Perception Based Admissible Wavelet Packet Trees For Speech Recognition
    Nehe, N. S.
    Holambe, R. S.
    [J]. IEEE REGION 10 COLLOQUIUM AND THIRD INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS, VOLS 1 AND 2, 2008, : 175 - 179
  • [5] Articulation based admissible wavelet packet feature based on human cochlear frequency response for TIMIT speech recognition
    Biswas, Astik
    Sahu, P. K.
    Bhowmick, Anirban
    Chandra, Mahesh
    [J]. AIN SHAMS ENGINEERING JOURNAL, 2014, 5 (04) : 1189 - 1198
  • [6] Speech Recognition using ERB-like Admissible Wavelet Packet Decomposition based on Perceptual sub-band Weighting
    Biswas, Astik
    Sahu, P. K.
    Bhowmick, Anirban
    Chandra, Mahesh
    [J]. IETE JOURNAL OF RESEARCH, 2016, 62 (02) : 129 - 139
  • [7] Admissible wavelet packet sub-band based harmonic energy features using ANOVA fusion techniques for Hindi phoneme recognition
    Biswas, Astik
    Sahu, P. K.
    Chandra, Mahesh
    [J]. IET SIGNAL PROCESSING, 2016, 10 (08) : 902 - 911
  • [8] Mel filter-like admissible wavelet packet structure for speech recognition
    Farooq, O
    Datta, S
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2001, 8 (07) : 196 - 198
  • [9] Phoneme recognition using wavelet based features
    Farooq, O
    Datta, S
    [J]. INFORMATION SCIENCES, 2003, 150 (1-2) : 5 - 15
  • [10] Admissible wavelet packet features based on human inner ear frequency response for Hindi consonant recognition
    Biswas, Astik
    Sahu, P. K.
    Chandra, Mahesh
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2014, 40 (04) : 1111 - 1122