Auditory-based wavelet packet filterbank for speech recognition using neural network

被引:12
|
作者
Gandhiraj, R. [1 ]
Sathidevi, P. S. [2 ]
机构
[1] Dr Mahalingam Coll Engg & Tech, ECE Dept, Pollachi, Tamil Nadu, India
[2] Nat Inst Technol Calicut, ECE Dept, Calicut, Kerala, India
关键词
auditory-based; speech recognition; wavelet packet; neural network;
D O I
10.1109/ADCOM.2007.104
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A major problem of most speech recognition systems is their unsatisfactory robustness in noise. Human inner ear based 'feature extraction' leads to very robust speech understanding in noise. This 'Model of Auditory Periphery' is acting as front-end model of this speech recognition process. This paper describes two quantitative models for signal processing in auditory system (i) Gamma Tone Filter Bank (GTFB) and (ii) Wavelet Packet (WP) as front-ends for robust speech recognition. The auditory feature vectors had been used to train neural network. The classification of the feature vectors was done by the neural network using Back Propagation (BP) algorithm. The system performance was measured by recognition rate with various signal-to-noise ratios over -10 to 10 dB. The proposed system's performance was compared with various types of front-ends and recognition methods such as auditory features with Hidden Markov Model (HMM) & Layered Neural Network (LRNN), auditory features with Mel Frequency Cepstral Coefficient (WFCC) & LRNN and vocal tract model: MFCC & HMM, Dynamic time warping (DTW). The performances of proposed models with gamma tone filter bank and wavelet packet as front-ends were also compared. It had been identified that proposed system with wavelet packet as front-end and Back Propagation Neural Network (BPNN) as the recognition method is having good recognition rate over -10 to 10 dB. Both speaker independent and speaker dependent recognition systems had been designed, implemented and tested.
引用
收藏
页码:666 / +
页数:2
相关论文
共 50 条
  • [21] On the relevance. of auditory-based Gabor features for deep learning in robust speech recognition
    Martinez, Angel Mario Castro
    Mallidi, Sri Harish
    Meyer, Bernd T.
    [J]. COMPUTER SPEECH AND LANGUAGE, 2017, 45 : 21 - 38
  • [22] Speech Emotion Recognition Based on Wavelet Packet Coefficient Model
    Wang, Kunxia
    An, Ning
    Li, Lian
    [J]. 2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 478 - 482
  • [23] The Novel Recognition Method with Optimal Wavelet Packet and LSTM based Recurrent Neural Network
    Li, Mingai
    Zhu, Wei
    Zhang, Meng
    Sun, Yanjun
    Wang, Zhe
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (ICMA), 2017, : 584 - 589
  • [24] Robust classification of stop consonants using auditory-based speech processing
    Ali, AMA
    Van der Spiegel, J
    Mueller, P
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 81 - 84
  • [25] A Noise-Robust Speech Recognition System Based on Wavelet Neural Network
    Wang, Yiping
    Zhao, Zhefeng
    [J]. ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, PT III, 2011, 7004 : 392 - 397
  • [26] Speech Emotion Recognition Research Based on Wavelet Neural Network for Robot Pet
    Huang, Yongming
    Zhang, Guobao
    Xu, Xiaoli
    [J]. EMERGING INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS: WITH ASPECTS OF ARTIFICIAL INTELLIGENCE, 2009, 5755 : 993 - 1000
  • [27] Speech recognition based on cooperative particle swarm optimizer wavelet neural network
    Chen, Li-Wei
    Zhang, Ye
    [J]. 2007 INTERNATIONAL CONFERENCE ON WAVELET ANALYSIS AND PATTERN RECOGNITION, VOLS 1-4, PROCEEDINGS, 2007, : 716 - 720
  • [28] Wavelet packet based features selection and fuzzy ARTMAP neural network classifier for speech classification
    Radfar, MH
    Faez, K
    Sayadiyan, A
    Mobini, N
    [J]. PROCEEDINGS OF THE 3RD IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY, 2003, : 620 - 624
  • [29] Enhancement of Speech by Using Undecimated Wavelet Packet-Perceptual Filterbank and MM_LSA Estimation
    Tasmaz, Haci
    Ercelebi, Ergun
    [J]. 2008 IEEE 16TH SIGNAL PROCESSING, COMMUNICATION AND APPLICATIONS CONFERENCE, VOLS 1 AND 2, 2008, : 537 - 540
  • [30] Signal recognition based on wavelet and wavelet neural network
    Wu, YJ
    Shi, XZ
    Xu, M
    [J]. THEORETICAL ASPECTS OF NEURAL COMPUTATION: A MULTIDISCIPLINARY PERSPECTIVE, 1998, : 189 - 194