Auditory-based wavelet packet filterbank for speech recognition using neural network

被引:12
|
作者
Gandhiraj, R. [1 ]
Sathidevi, P. S. [2 ]
机构
[1] Dr Mahalingam Coll Engg & Tech, ECE Dept, Pollachi, Tamil Nadu, India
[2] Nat Inst Technol Calicut, ECE Dept, Calicut, Kerala, India
关键词
auditory-based; speech recognition; wavelet packet; neural network;
D O I
10.1109/ADCOM.2007.104
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A major problem of most speech recognition systems is their unsatisfactory robustness in noise. Human inner ear based 'feature extraction' leads to very robust speech understanding in noise. This 'Model of Auditory Periphery' is acting as front-end model of this speech recognition process. This paper describes two quantitative models for signal processing in auditory system (i) Gamma Tone Filter Bank (GTFB) and (ii) Wavelet Packet (WP) as front-ends for robust speech recognition. The auditory feature vectors had been used to train neural network. The classification of the feature vectors was done by the neural network using Back Propagation (BP) algorithm. The system performance was measured by recognition rate with various signal-to-noise ratios over -10 to 10 dB. The proposed system's performance was compared with various types of front-ends and recognition methods such as auditory features with Hidden Markov Model (HMM) & Layered Neural Network (LRNN), auditory features with Mel Frequency Cepstral Coefficient (WFCC) & LRNN and vocal tract model: MFCC & HMM, Dynamic time warping (DTW). The performances of proposed models with gamma tone filter bank and wavelet packet as front-ends were also compared. It had been identified that proposed system with wavelet packet as front-end and Back Propagation Neural Network (BPNN) as the recognition method is having good recognition rate over -10 to 10 dB. Both speaker independent and speaker dependent recognition systems had been designed, implemented and tested.
引用
收藏
页码:666 / +
页数:2
相关论文
共 50 条
  • [41] Dereverberation based on Wavelet Packet Filtering for Robust Automatic Speech Recognition
    Gomez, Randy
    Kawahara, Tatsuya
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1242 - 1245
  • [42] Speech Emotion Recognition Based on Coiflet Wavelet Packet Cepstral Coefficients
    Huang, Yongming
    Wu, Ao
    Zhang, Guobao
    Li, Yue
    [J]. PATTERN RECOGNITION (CCPR 2014), PT II, 2014, 484 : 436 - 443
  • [43] Auditory-based acoustic distinctive features and spectral cues for automatic speech recognition using a multi-stream paradigm
    Tolba, H
    Selouani, SA
    O'Shaughnessy, D
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 837 - 840
  • [44] Face Recognition Based on Wavelet Neural Network
    Zhang, Hong
    [J]. ADVANCED RESEARCH ON INDUSTRY, INFORMATION SYSTEMS AND MATERIAL ENGINEERING, PTS 1-7, 2011, 204-210 : 216 - 219
  • [45] An auditory-based distortion measure with application to concatenative speech synthesis
    Hansen, JHL
    Chappell, DT
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1998, 6 (05): : 489 - 495
  • [46] Automatic Recognition of Retinopathy Diseases by Using Wavelet Based Neural Network
    Yagmur, Fatma Demirezen
    Karlik, Bekir
    Okatan, Ali
    [J]. 2008 FIRST INTERNATIONAL CONFERENCE ON THE APPLICATIONS OF DIGITAL INFORMATION AND WEB TECHNOLOGIES, VOLS 1 AND 2, 2008, : 461 - +
  • [47] Shape-based target recognition using wavelet neural network
    Pan, Hong
    Xia, Liangzheng
    [J]. Shuju Caiji Yu Chuli/Journal of Data Acquisition and Processing, 2008, 23 (01): : 27 - 34
  • [48] Monaural Auditory-Based Unvoiced Speech Segregation Using SNR-Based Subband Spectral Subtraction
    Geravanchizadeh, Masoud
    Dadvar, Paria
    [J]. ACTA ACUSTICA UNITED WITH ACUSTICA, 2014, 100 (02) : 353 - 361
  • [49] A novel method for breast cancer prognosis using wavelet packet based neural network
    Jamarani, A. Sepehr M. H.
    Rezai-rad, B. Gholamali
    Behnam, C. Hamid
    [J]. 2005 27TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-7, 2005, : 3414 - 3417
  • [50] Speech Recognition using Artificial Neural Network
    Gupta, Arpita
    Joshi, Akshay
    [J]. PROCEEDINGS OF THE 2018 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION AND SIGNAL PROCESSING (ICCSP), 2018, : 68 - 71