Auditory-based wavelet packet filterbank for speech recognition using neural network

被引:12
|
作者
Gandhiraj, R. [1 ]
Sathidevi, P. S. [2 ]
机构
[1] Dr Mahalingam Coll Engg & Tech, ECE Dept, Pollachi, Tamil Nadu, India
[2] Nat Inst Technol Calicut, ECE Dept, Calicut, Kerala, India
关键词
auditory-based; speech recognition; wavelet packet; neural network;
D O I
10.1109/ADCOM.2007.104
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A major problem of most speech recognition systems is their unsatisfactory robustness in noise. Human inner ear based 'feature extraction' leads to very robust speech understanding in noise. This 'Model of Auditory Periphery' is acting as front-end model of this speech recognition process. This paper describes two quantitative models for signal processing in auditory system (i) Gamma Tone Filter Bank (GTFB) and (ii) Wavelet Packet (WP) as front-ends for robust speech recognition. The auditory feature vectors had been used to train neural network. The classification of the feature vectors was done by the neural network using Back Propagation (BP) algorithm. The system performance was measured by recognition rate with various signal-to-noise ratios over -10 to 10 dB. The proposed system's performance was compared with various types of front-ends and recognition methods such as auditory features with Hidden Markov Model (HMM) & Layered Neural Network (LRNN), auditory features with Mel Frequency Cepstral Coefficient (WFCC) & LRNN and vocal tract model: MFCC & HMM, Dynamic time warping (DTW). The performances of proposed models with gamma tone filter bank and wavelet packet as front-ends were also compared. It had been identified that proposed system with wavelet packet as front-end and Back Propagation Neural Network (BPNN) as the recognition method is having good recognition rate over -10 to 10 dB. Both speaker independent and speaker dependent recognition systems had been designed, implemented and tested.
引用
收藏
页码:666 / +
页数:2
相关论文
共 50 条
  • [1] Speech recognition based on auditory wavelet packet filter
    Zhang, XY
    Jiao, ZP
    [J]. 2004 7TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS 1-3, 2004, : 695 - 698
  • [2] AN AUDITORY-BASED FEATURE FOR ROBUST SPEECH RECOGNITION
    Shao, Yang
    Jin, Zhaozhang
    Wang, DeLiang
    Srinivasan, Soundararajan
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4625 - +
  • [3] Auditory Perception Based Admissible Wavelet Packet Trees For Speech Recognition
    Nehe, N. S.
    Holambe, R. S.
    [J]. IEEE REGION 10 COLLOQUIUM AND THIRD INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS, VOLS 1 AND 2, 2008, : 175 - 179
  • [4] Discriminative auditory-based features for robust speech recognition
    Mak, BKW
    Tam, YC
    Li, PQ
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2004, 12 (01): : 27 - 36
  • [5] Speech Enhancement Using Auditory-Based Transform
    Tank, Vanita Raj
    Mahajan, S. P.
    Khaparde, Arti
    Deshpande, Rahul
    [J]. 2015 10TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS AND SIGNAL PROCESSING (ICICS), 2015,
  • [6] Auditory filterbank denoising neural network for speech enhancement in wearable auditory device
    Kim, Seon Man
    [J]. ELECTRONICS LETTERS, 2024, 60 (10)
  • [7] Speech recognition using a wavelet packet adaptive network based fuzzy inference system
    Avci, Engin
    Akpolat, Zuhtu Hakan
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2006, 31 (03) : 495 - 503
  • [8] Speech Emotion Recognition Using Multichannel Parallel Convolutional Recurrent Neural Networks based on Gammatone Auditory Filterbank
    Peng, Zhichao
    Zhu, Zhi
    Unoki, Masashi
    Dang, Jianwu
    Akagi, Masato
    [J]. 2017 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC 2017), 2017, : 1750 - 1755
  • [9] Ultrasonic Target Recognition Based on Wavelet Packet and Neural Network
    Kou, Xueqin
    Gu, Lichen
    [J]. PROCEEDINGS OF THE 2012 EIGHTH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS 2012), 2012, : 271 - 274
  • [10] Intelligent target recognition based on wavelet packet neural network
    Avci, E
    Turkoglu, I
    Poyraz, M
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2005, 29 (01) : 175 - 182