Auditory Perception Based Admissible Wavelet Packet Trees For Speech Recognition

被引:0
|
作者
Nehe, N. S. [1 ]
Holambe, R. S. [1 ]
机构
[1] SGGS Inst Engn & Technol, Dept Instrumentat Engn, Nanded, MS, India
关键词
Wavelet Packet Tree; Human Auditory Perception; Isolated Word Recognition;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents the use of auditory perception based admissible Wavelet Packet Tree (WPT) for partitioning of speech frequencies into different bands based on the Mel scale or the Bark Scale. The proposed WPTs selected using Root Mean Square Error (RMSE) criterion mimic the Mel scale or the Bark scale more accurately and hence the human auditory system. Performance of the features obtained from the proposed WPTs is compared with Mel Frequency Cepstral Coefficients (MFCC). The algorithms are evaluated using NIST TI-46 isolated-word database using Hidden Markov Model (HMM) as a classifier. Experimental results show that the performance of proposed features is better than MFCC and other Wavelet features for Isolated Word Recognition (IWR).
引用
收藏
页码:175 / 179
页数:5
相关论文
共 50 条
  • [1] Speech recognition based on auditory wavelet packet filter
    Zhang, XY
    Jiao, ZP
    [J]. 2004 7TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS 1-3, 2004, : 695 - 698
  • [2] Design of optimal wavelet packet trees based on auditory perception criterion
    Karmakar, Abhijit
    Kumar, Arun
    Patney, R. K.
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2007, 14 (04) : 240 - 243
  • [3] Auditory ERB like admissible wavelet packet features for TIMIT phoneme recognition
    Sahu, P. K.
    Biswas, Astik
    Bhowmick, Anirban
    Chandra, Mahesh
    [J]. ENGINEERING SCIENCE AND TECHNOLOGY-AN INTERNATIONAL JOURNAL-JESTECH, 2014, 17 (03): : 145 - 151
  • [4] Mel filter-like admissible wavelet packet structure for speech recognition
    Farooq, O
    Datta, S
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2001, 8 (07) : 196 - 198
  • [5] Auditory-based wavelet packet filterbank for speech recognition using neural network
    Gandhiraj, R.
    Sathidevi, P. S.
    [J]. ADCOM 2007: PROCEEDINGS OF THE 15TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND COMMUNICATIONS, 2007, : 666 - +
  • [6] Articulation based admissible wavelet packet feature based on human cochlear frequency response for TIMIT speech recognition
    Biswas, Astik
    Sahu, P. K.
    Bhowmick, Anirban
    Chandra, Mahesh
    [J]. AIN SHAMS ENGINEERING JOURNAL, 2014, 5 (04) : 1189 - 1198
  • [7] Robust features for speech recognition based on admissible wavelet packets
    Farooq, O
    Datta, S
    [J]. ELECTRONICS LETTERS, 2001, 37 (25) : 1554 - 1556
  • [8] Speech Emotion Recognition Based on Wavelet Packet Coefficient Model
    Wang, Kunxia
    An, Ning
    Li, Lian
    [J]. 2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 478 - 482
  • [9] Speech Recognition using ERB-like Admissible Wavelet Packet Decomposition based on Perceptual sub-band Weighting
    Biswas, Astik
    Sahu, P. K.
    Bhowmick, Anirban
    Chandra, Mahesh
    [J]. IETE JOURNAL OF RESEARCH, 2016, 62 (02) : 129 - 139
  • [10] AUDITORY-PERCEPTION AND SPEECH RECOGNITION
    LEBEDEV, VG
    ZAGORUIKO, NG
    [J]. SPEECH COMMUNICATION, 1985, 4 (1-3) : 97 - 103