A new representation for speech frame recognition based on redundant wavelet filter banks

被引:8
|
作者
Tohidypour, Hamid Reza [1 ]
Seyyedsalehi, Seyyed Ali [1 ]
Behbood, Hossein [1 ]
Roshandel, Hossein [2 ]
机构
[1] Amirkabir Univ Technol, Dept Biomed Engn, Tehran 158754413, Iran
[2] Amirkabir Univ Technol, Dept Elect Engn, Tehran Polytech, Tehran 158754413, Iran
关键词
Redundant wavelet filter-bank (RWFB); Wavelet transform (WT); Speech frame recognition; Representation; Frame wavelet; Zero moments; Four-channel higher density discrete wavelet; Time delay neural network (TDNN); TRANSFORM;
D O I
10.1016/j.specom.2011.09.001
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Although the conventional wavelet transform possesses multi-resolution properties, it is not optimized for speech recognition systems. It suffers from lower performance compared with Mel Frequency Cepstral Coefficients (MFCCs) in which Mel scale is based on human auditory perception. In this paper, some new speech representations based on redundant wavelet filter-banks (RWFB) are proposed. RWFB parameters are much less shift-sensitive than those of critically sampled discrete wavelet transform (DWT), so they seem to feature better performance in speech recognition tasks because of having better time-frequency localization ability. However, the improvement is at the expense of higher redundancy. In this paper, some types of wavelet representations are introduced, including a combination of critically sampled DWT and some different multi-channel redundant filter-banks down-sampled by 2. In order to find appropriate filter values for multi-channel filter-banks, effects of changing the zero moments of proposed wavelet are discussed. The corresponding method performances are compared in a phoneme recognition task using time delay neural networks. It is revealed that redundant multi-channel wavelet filter-banks work better than conventional DWT in speech recognition systems. The proposed four-channel higher density discrete wavelet filter-bank results in up to approximately 8.95% recognition rate increase, compared with critically sampled two-channel wavelet filter-bank. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:256 / 271
页数:16
相关论文
共 50 条
  • [41] Mel filter-like admissible wavelet packet structure for speech recognition
    Farooq, O
    Datta, S
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2001, 8 (07) : 196 - 198
  • [42] Adaptive wavelet EMG compression based on local optimization of filter banks
    Paiva, Juliana Pereira Lisboa M.
    Kelencz, Carlos Alberto
    Paiva, Henrique Mohallem
    Galvao, Roberto Kawakami H.
    Magini, Marcio
    [J]. PHYSIOLOGICAL MEASUREMENT, 2008, 29 (07) : 843 - 856
  • [43] Redundant discrete wavelet transforms based moving object recognition and tracking
    Gao Tao
    Liu Zhengguang
    Zhang Jun
    [J]. JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2009, 20 (05) : 1115 - 1123
  • [45] Redundant discrete wavelet transforms based moving object recognition and tracking
    School of Electrical Engineering and Automation, Tianjin Univ., Tianjin 300072, China
    [J]. J Syst Eng Electron, 2009, 5 (1115-1123):
  • [46] A signal processing method based on wavelet filter banks for vortex flowmeter
    Hefei University of Technology, Hefei 230009, China
    [J]. Jiliang Xuebao, 2006, 2 (133-136):
  • [47] Directional filter banks for wavelet decomposition of images based on the radon transform
    von Borries, R. E.
    Miosso, C. Jacques
    Potes, C. M.
    [J]. CONFERENCE RECORD OF THE FORTY-FIRST ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1-5, 2007, : 2095 - 2099
  • [48] Palmprint Identification Based on Non-separable Wavelet Filter Banks
    Wu, Jie
    You, Xinge
    Tang, Yuan Yan
    Cheung, Yiu-ming
    [J]. 19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 2917 - 2920
  • [49] Invariant pattern recognition filter based on the Wavelet transform
    Zalevsky, Z
    Mendlovic, D
    Ferreira, C
    [J]. SECOND IBEROAMERICAN MEETING ON OPTICS, 1996, 2730 : 275 - 283
  • [50] Design of Nearly-Orthogonal Symmetric Wavelet Filter Banks Based on the Wavelet Orthogonalization Process
    Fabrício Ely Gossler
    Marco Aparecido Queiroz Duarte
    Francisco Villarreal
    [J]. Circuits, Systems, and Signal Processing, 2023, 42 : 234 - 254