Auditory Perception Based Admissible Wavelet Packet Trees For Speech Recognition

被引:0
|
作者
Nehe, N. S. [1 ]
Holambe, R. S. [1 ]
机构
[1] SGGS Inst Engn & Technol, Dept Instrumentat Engn, Nanded, MS, India
关键词
Wavelet Packet Tree; Human Auditory Perception; Isolated Word Recognition;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents the use of auditory perception based admissible Wavelet Packet Tree (WPT) for partitioning of speech frequencies into different bands based on the Mel scale or the Bark Scale. The proposed WPTs selected using Root Mean Square Error (RMSE) criterion mimic the Mel scale or the Bark scale more accurately and hence the human auditory system. Performance of the features obtained from the proposed WPTs is compared with Mel Frequency Cepstral Coefficients (MFCC). The algorithms are evaluated using NIST TI-46 isolated-word database using Hidden Markov Model (HMM) as a classifier. Experimental results show that the performance of proposed features is better than MFCC and other Wavelet features for Isolated Word Recognition (IWR).
引用
收藏
页码:175 / 179
页数:5
相关论文
共 50 条
  • [41] Speech feature extraction of cochlear implants on the basis of auditory perception wavelet transform
    Tao, Zhi
    Zhao, Heming
    Gu, Jihua
    Tan, Xuedan
    Wu, Jun
    [J]. 2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 80 - 86
  • [42] Synthesis of an Optimal Wavelet Based on Auditory Perception Criterion
    Abhijit Karmakar
    Arun Kumar
    R. K. Patney
    [J]. EURASIP Journal on Advances in Signal Processing, 2011
  • [43] Synthesis of an Optimal Wavelet Based on Auditory Perception Criterion
    Karmakar, Abhijit
    Kumar, Arun
    Patney, R. K.
    [J]. EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2011,
  • [44] A new speech enhancement method based on wavelet packet transform
    Wang, Jizeng
    Wang, Chanfei
    [J]. CISP 2008: FIRST INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOL 4, PROCEEDINGS, 2008, : 261 - 265
  • [45] Orthography shapes the perception of speech: The consistency effect in auditory word recognition
    Johannes C. Ziegler
    Ludovic Ferrand
    [J]. Psychonomic Bulletin & Review, 1998, 5 : 683 - 689
  • [46] A model of dynamic auditory perception and its application to robust speech recognition
    Strope, B
    Alwan, A
    [J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 37 - 40
  • [47] Modeling human auditory perception for noise-robust speech recognition
    Lee, SY
    [J]. PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS AND BRAIN, VOLS 1-3, 2005, : PL72 - PL74
  • [48] Orthography shapes the perception of speech: The consistency effect in auditory word recognition
    Ziegler, JC
    Ferrand, L
    [J]. PSYCHONOMIC BULLETIN & REVIEW, 1998, 5 (04) : 683 - 689
  • [49] Speech and image compressions by DCT, wavelet, and wavelet packet
    Chong, WY
    Kim, J
    [J]. ICICS - PROCEEDINGS OF 1997 INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS AND SIGNAL PROCESSING, VOLS 1-3: THEME: TRENDS IN INFORMATION SYSTEMS ENGINEERING AND WIRELESS MULTIMEDIA COMMUNICATIONS, 1997, : 1353 - 1357
  • [50] Admissible wavelet packet sub-band based harmonic energy features using ANOVA fusion techniques for Hindi phoneme recognition
    Biswas, Astik
    Sahu, P. K.
    Chandra, Mahesh
    [J]. IET SIGNAL PROCESSING, 2016, 10 (08) : 902 - 911