Articulation based admissible wavelet packet feature based on human cochlear frequency response for TIMIT speech recognition

被引:1
|
作者
Biswas, Astik [1 ]
Sahu, P. K. [1 ]
Bhowmick, Anirban [2 ]
Chandra, Mahesh [2 ]
机构
[1] Natl Inst Technol, Dept Elect Engn, Rourkela, India
[2] Birla Inst Technol, Dept Elect & Commun, Ranchi, Bihar, India
关键词
Speech recognition; Wavelet packets; ERB scale; HMM; Phoneme recognition;
D O I
10.1016/j.asej.2014.07.006
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
To deal with non-stationary and quasi-stationary signals, wavelet transform has been used as an effective tool for the time-frequency analysis. In the recent years, wavelet transform has been used extensively for feature extraction in noisy speech recognition. These filters have the benefit of having frequency bands spacing similar to the auditory Equivalent Rectangular Bandwidth (ERB) scale. Central frequencies of ERB are equally distributed with the frequency response of the human cochlea. This paper deals with the speaker-independent Automatic Speech Recognition (ASR) system for continuous speech. This Hidden Markov Model (HMM) based ASR system was developed for English using recordings of four regions taken from TIMIT database. A new set of features were derived using wavelet packet transform's multi-resolution capabilities and having an advantage of ERB filter based on the human cochlea. New set of wavelet features have shown significant improvements in the noisy environment, especially at low SNR values. (C) 2014 Production and hosting by Elsevier B.V. on behalf of Ain Shams University.
引用
收藏
页码:1189 / 1198
页数:10
相关论文
共 50 条
  • [1] Admissible wavelet packet features based on human inner ear frequency response for Hindi consonant recognition
    Biswas, Astik
    Sahu, P. K.
    Chandra, Mahesh
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2014, 40 (04) : 1111 - 1122
  • [2] Auditory Perception Based Admissible Wavelet Packet Trees For Speech Recognition
    Nehe, N. S.
    Holambe, R. S.
    [J]. IEEE REGION 10 COLLOQUIUM AND THIRD INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS, VOLS 1 AND 2, 2008, : 175 - 179
  • [3] Auditory ERB like admissible wavelet packet features for TIMIT phoneme recognition
    Sahu, P. K.
    Biswas, Astik
    Bhowmick, Anirban
    Chandra, Mahesh
    [J]. ENGINEERING SCIENCE AND TECHNOLOGY-AN INTERNATIONAL JOURNAL-JESTECH, 2014, 17 (03): : 145 - 151
  • [4] Robust features for speech recognition based on admissible wavelet packets
    Farooq, O
    Datta, S
    [J]. ELECTRONICS LETTERS, 2001, 37 (25) : 1554 - 1556
  • [5] Speech recognition based on auditory wavelet packet filter
    Zhang, XY
    Jiao, ZP
    [J]. 2004 7TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS 1-3, 2004, : 695 - 698
  • [6] Adaptive Wavelet Packet Filter-Bank Based Acoustic Feature for Speech Emotion Recognition
    Li, Yue
    Zhang, Guobao
    Huang, Yongming
    [J]. PROCEEDINGS OF 2013 CHINESE INTELLIGENT AUTOMATION CONFERENCE: INTELLIGENT INFORMATION PROCESSING, 2013, 256 : 359 - 366
  • [7] Mel filter-like admissible wavelet packet structure for speech recognition
    Farooq, O
    Datta, S
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2001, 8 (07) : 196 - 198
  • [8] Speech recognition with emphasis on wavelet based feature extraction
    Farooq, O
    Datta, S
    [J]. IETE JOURNAL OF RESEARCH, 2002, 48 (01) : 3 - 13
  • [9] Speech Emotion Recognition Based on Wavelet Packet Coefficient Model
    Wang, Kunxia
    An, Ning
    Li, Lian
    [J]. 2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 478 - 482
  • [10] A new feature in speech recognition based on wavelet transform
    Hao, Y
    Zhu, XY
    [J]. 2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 1526 - 1529