Mel scaled M-band wavelet filter bank for speech recognition

被引:3
|
作者
Upadhyaya P. [1 ]
Farooq O. [1 ]
Abidi M.R. [1 ]
机构
[1] Department of Electronics Engineering, Aligarh Muslim University, Aligarh, Uttar Pradesh
关键词
Dyadic; Filter bank and feature extraction; M-band wavelet; MFCC;
D O I
10.1007/s10772-018-9545-2
中图分类号
学科分类号
摘要
A Mel scaled M-band wavelet filter bank structure is used to extract the robust acoustic feature for speech recognition application. The proposed filter bank can provide flexibility of frequency partition that decomposes the speech signal into the M-frequency band. To estimate the difference between Mel scaled M-band wavelet and dyadic wavelet filter bank, relative bandwidth deviation (RBD) and root mean square bandwidth deviation (RMSBD) with respect to baseline (Mel filter bank bandwidth) is calculated. Proposed filter bank gives 40.90 and 49.84% reduction for RBD and RMSBD respectively, over 24-dyadic wavelet filter bank. Feature extraction from the proposed filter bank using AMUAV corpus shows an improvement in terms of word recognition accuracy (WRA) at all SNR range (20 dB to 0 dB) over baseline (MFCC) features. For AMUAV corpus, the proposed feature shows the maximum improvement in WRA of 3.93% over baseline features and 3.90% over dyadic wavelet filter bank features. When applied to the VidTIMIT corpus, proposed features show the maximum improvement in WRA of 1.64% over baseline features and 4.43% over dyadic features. © 2018, Springer Science+Business Media, LLC, part of Springer Nature.
引用
收藏
页码:797 / 807
页数:10
相关论文
共 50 条
  • [1] Mel-scaled discrete wavelet coefficients for speech recognition
    Gowdy, JN
    Tufekci, Z
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1351 - 1354
  • [2] M-ARY WAVELET TRANSFORM AND FORMULATION FOR PERFECT RECONSTRUCTION IN M-BAND FILTER BANK
    YAOU, MH
    CHANG, WT
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1994, 42 (12) : 3508 - 3512
  • [3] Filter bank tree and M-band wavelet packet algorithms in audio signal processing
    Kurth, F
    Clausen, M
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1999, 47 (02) : 549 - 554
  • [4] Recognition of Subsampled Speech Using a Modified Mel Filter Bank
    Bhuvanagiri, Kiran Kumar
    Kopparapu, Sunil Kumar
    [J]. ADVANCES IN COMPUTING AND COMMUNICATIONS, PT 4, 2011, 193 : 293 - 299
  • [5] Recognition of subsampled speech using a modified Mel filter bank
    Kopparapu, Sunil Kumar
    Bhuvanagiri, Kiran Kumar
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2013, 39 (02) : 655 - 662
  • [6] M-Band Wavelet Transform in Face Recognition System
    Wong, Yee Wan
    Seng, Kah Phooi
    Ang, Li-Minn
    [J]. ECTI-CON 2008: PROCEEDINGS OF THE 2008 5TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING/ELECTRONICS, COMPUTER, TELECOMMUNICATIONS AND INFORMATION TECHNOLOGY, VOLS 1 AND 2, 2008, : 453 - 456
  • [7] A fast implementation of wavelet transform for m-band filter banks
    Tian, J
    Wells, RO
    [J]. WAVELET APPLICATIONS V, 1998, 3391 : 534 - 545
  • [8] NEW REGULAR M-BAND ORTHOGONAL WAVELET FILTER BANK DESIGN USING ZERO-INSERTION METHOD
    KWON, SK
    KIM, JK
    [J]. ELECTRONICS LETTERS, 1994, 30 (10) : 753 - 754
  • [9] Modified M-band synthesis filter bank for fractional scalability of images
    Pau, Gregoire
    Pesquet-Popescu, Beatrice
    Piella, Gemma
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2006, 13 (06) : 345 - 348
  • [10] Mel filter-like admissible wavelet packet structure for speech recognition
    Farooq, O
    Datta, S
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2001, 8 (07) : 196 - 198