Speech frame recognition based on less shift sensitive wavelet filter banks

被引:0
|
作者
Hamid Reza Tohidypour
Amin Banitalebi-Dehkordi
机构
[1] University of British Columbia,Digital Multimedia Lab, Department of Electrical and Computer Engineering
来源
关键词
Dual-tree complex wavelet transform (DT-CWT); Four-channel double-density discrete wavelet transform (FCDDDWT); Redundant wavelet filter bank (RWFB); Wavelet transform (WT); Speech frame recognition; Perceptual dual-tree complex wavelet filter bank;
D O I
暂无
中图分类号
学科分类号
摘要
The wavelet transform possesses multi-resolution property and high localization performance; hence, it can be optimized for speech recognition. In our previous work, we show that redundant wavelet filter bank parameters work better in speech recognition task, because they are much less shift sensitive than those of critically sampled discrete wavelet transform (DWT). In this paper, three types of wavelet representations are introduced, including features based on dual-tree complex wavelet transform (DT-CWT), perceptual dual-tree complex wavelet transform, and four-channel double-density discrete wavelet transform (FCDDDWT). Then, appropriate filter values for DT-CWT and FCDDDWT are proposed. The performances of the proposed wavelet representations are compared in a phoneme recognition task using special form of the time-delay neural networks. Performance evaluations confirm that dual-tree complex wavelet filter banks outperform conventional DWT in speech recognition systems. The proposed perceptual dual-tree complex wavelet filter bank results in up to approximately 9.82 % recognition rate increase, compared to the critically sampled two-channel wavelet filter bank.
引用
收藏
页码:633 / 637
页数:4
相关论文
共 50 条
  • [1] Speech frame recognition based on less shift sensitive wavelet filter banks
    Tohidypour, Hamid Reza
    Banitalebi-Dehkordi, Amin
    [J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2016, 10 (04) : 633 - 637
  • [2] A new representation for speech frame recognition based on redundant wavelet filter banks
    Tohidypour, Hamid Reza
    Seyyedsalehi, Seyyed Ali
    Behbood, Hossein
    Roshandel, Hossein
    [J]. SPEECH COMMUNICATION, 2012, 54 (02) : 256 - 271
  • [3] Frame analysis of wavelet type filter banks
    Stanhill, D
    Zeevi, YY
    [J]. 1996 IEEE DIGITAL SIGNAL PROCESSING WORKSHOP, PROCEEDINGS, 1996, : 435 - 438
  • [4] Sparse Wavelet Decomposition and Filter Banks with CNN Deep Learning for Speech Recognition
    Dai, Jingzhao
    Zhang, Yaan
    Hou, Jintao
    Wang, Xiewen
    Tan, Lizhe
    Jiang, Jean
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ELECTRO INFORMATION TECHNOLOGY (EIT), 2019, : 98 - 103
  • [5] Speech recognition based on auditory wavelet packet filter
    Zhang, XY
    Jiao, ZP
    [J]. 2004 7TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS 1-3, 2004, : 695 - 698
  • [6] Frame analysis of wavelet-type filter banks
    Stanhill, D
    Zeevi, YY
    [J]. SIGNAL PROCESSING, 1998, 67 (02) : 125 - 139
  • [7] TEXTURE-BASED FINGERPRINT RECOGNITION COMBINING DIRECTIONAL FILTER BANKS AND WAVELET
    Li, Chaorong
    Fu, Bo
    Li, Jianping
    Yang, Xingchun
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2012, 26 (04)
  • [8] Wavelet based speech recognition
    Gamulkiewicz, B
    Weeks, M
    [J]. PROCEEDINGS OF THE 46TH IEEE INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS & SYSTEMS, VOLS 1-3, 2003, : 678 - 681
  • [9] Robust speech recognition by selecting mel-filter banks
    Wu, Yun-Peng
    Mao, Jia-Min
    Li, Wei-Feng
    [J]. PROCEEDINGS OF THE 2ND ANNUAL INTERNATIONAL CONFERENCE ON ELECTRONICS, ELECTRICAL ENGINEERING AND INFORMATION SCIENCE (EEEIS 2016), 2016, 117 : 407 - 416
  • [10] On the evaluation of wavelet filter banks for wavelet-based image watermarking
    Miyazaki, A
    [J]. ISPA 2003: PROCEEDINGS OF THE 3RD INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS, PTS 1 AND 2, 2003, : 877 - 882