Speech frame recognition based on less shift sensitive wavelet filter banks

被引:0
|
作者
Hamid Reza Tohidypour
Amin Banitalebi-Dehkordi
机构
[1] University of British Columbia,Digital Multimedia Lab, Department of Electrical and Computer Engineering
来源
关键词
Dual-tree complex wavelet transform (DT-CWT); Four-channel double-density discrete wavelet transform (FCDDDWT); Redundant wavelet filter bank (RWFB); Wavelet transform (WT); Speech frame recognition; Perceptual dual-tree complex wavelet filter bank;
D O I
暂无
中图分类号
学科分类号
摘要
The wavelet transform possesses multi-resolution property and high localization performance; hence, it can be optimized for speech recognition. In our previous work, we show that redundant wavelet filter bank parameters work better in speech recognition task, because they are much less shift sensitive than those of critically sampled discrete wavelet transform (DWT). In this paper, three types of wavelet representations are introduced, including features based on dual-tree complex wavelet transform (DT-CWT), perceptual dual-tree complex wavelet transform, and four-channel double-density discrete wavelet transform (FCDDDWT). Then, appropriate filter values for DT-CWT and FCDDDWT are proposed. The performances of the proposed wavelet representations are compared in a phoneme recognition task using special form of the time-delay neural networks. Performance evaluations confirm that dual-tree complex wavelet filter banks outperform conventional DWT in speech recognition systems. The proposed perceptual dual-tree complex wavelet filter bank results in up to approximately 9.82 % recognition rate increase, compared to the critically sampled two-channel wavelet filter bank.
引用
收藏
页码:633 / 637
页数:4
相关论文
共 50 条
  • [21] Design of optimal shift-invariant orthonormal wavelet filter banks via genetic algorithm
    Shark, LK
    Yu, CY
    [J]. SIGNAL PROCESSING, 2003, 83 (12) : 2579 - 2591
  • [22] Mel scaled M-band wavelet filter bank for speech recognition
    Upadhyaya P.
    Farooq O.
    Abidi M.R.
    [J]. International Journal of Speech Technology, 2018, 21 (4) : 797 - 807
  • [23] The speech recognition based on the bark wavelet and CZCPA features
    Zhang Xueying
    Bai Jing
    [J]. 2006 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-12, 2006, : 318 - +
  • [24] Speech recognition with emphasis on wavelet based feature extraction
    Farooq, O
    Datta, S
    [J]. IETE JOURNAL OF RESEARCH, 2002, 48 (01) : 3 - 13
  • [25] The speech recognition system based on bark wavelet MFCC
    Zhang, Xue-ying
    Bai, Jing
    Liang, Wu-zhou
    [J]. 2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, : 780 - +
  • [26] Continuous Wavelet Transform based Speech Emotion Recognition
    Shegokar, Pankaj
    Sircar, Pradip
    [J]. 2016 10TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2016,
  • [27] A new feature in speech recognition based on wavelet transform
    Hao, Y
    Zhu, XY
    [J]. 2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 1526 - 1529
  • [28] Automatic Speech Recognition System Based on Wavelet Analysis
    Ziolko, Mariusz
    Galka, Jakub
    Ziolko, Bartosz
    Jadczyk, Tomasz
    Skurzok, Dawid
    Wicijowski, Jan
    [J]. 2010 IEEE FOURTH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC 2010), 2010, : 450 - 451
  • [29] Mel filter-like admissible wavelet packet structure for speech recognition
    Farooq, O
    Datta, S
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2001, 8 (07) : 196 - 198
  • [30] Adaptive wavelet EMG compression based on local optimization of filter banks
    Paiva, Juliana Pereira Lisboa M.
    Kelencz, Carlos Alberto
    Paiva, Henrique Mohallem
    Galvao, Roberto Kawakami H.
    Magini, Marcio
    [J]. PHYSIOLOGICAL MEASUREMENT, 2008, 29 (07) : 843 - 856