Phase Autocorrelation Bark Wavelet Transform (PACWT) Features for Robust Speech Recognition

被引:4
|
作者
Majeed, Sayf A. [1 ]
Husain, Hafizah [1 ]
Samad, Salina A. [1 ]
机构
[1] UKM, Natl Univ Malaysia, Fac Engn & Built Environm, Dept Elect Elect & Syst Engn, Bangi, Selangor, Malaysia
关键词
speech recognition; feature extraction; phase autocorrelation; wavelet transform; NOISE;
D O I
10.1515/aoa-2015-0004
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, a new feature-extraction method is proposed to achieve robustness of speech recognition systems. This method combines the benefits of phase autocorrelation (PAC) with bark wavelet transform. PAC uses the angle to measure correlation instead of the traditional autocorrelation measure, whereas the bark wavelet transform is a special type of wavelet transform that is particularly designed for speech signals. The extracted features from this combined method are called phase autocorrelation bark wavelet transform (PACWT) features. The speech recognition performance of the PACWT features is evaluated and compared to the conventional feature extraction method mel frequency cepstrum coefficients (MFCC) using TI-Digits database under different types of noise and noise levels. This database has been divided into male and female data. The result shows that the word recognition rate using the PACWT features for noisy male data (white noise at 0 dB SNR) is 60%, whereas it is 41.35% for the MFCC features under identical conditions.
引用
收藏
页码:25 / 31
页数:7
相关论文
共 50 条
  • [21] Subband Feature Statistics Normalization Techniques Based on a Discrete Wavelet Transform for Robust Speech Recognition
    Hung, Jeih-weih
    Fan, Hao-Teng
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2009, 16 (09) : 806 - 809
  • [22] Phase Auto Correlation (PAC) features for noise robust speech recognition
    Ikbal, Shajith
    Misra, Hemant
    Hermansky, Hynek
    Magimai-Doss, Mathew
    [J]. SPEECH COMMUNICATION, 2012, 54 (07) : 867 - 880
  • [23] Pitch detection of noisy speech signal based on bark wavelet transform
    Zhao, HM
    Zhu, Q
    Yu, YB
    Chen, XQ
    [J]. 2002 6TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I AND II, 2002, : 418 - 421
  • [24] Continuous Wavelet Transform based Speech Emotion Recognition
    Shegokar, Pankaj
    Sircar, Pradip
    [J]. 2016 10TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2016,
  • [25] A new feature in speech recognition based on wavelet transform
    Hao, Y
    Zhu, XY
    [J]. 2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 1526 - 1529
  • [26] Fractional Fourier transform features for speech recognition
    Sarikaya, R
    Gao, YQ
    Saon, G
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 529 - 532
  • [27] The Application of Adaptively Enhanced Bark Wavelet MFCC and The Introduction of A Novel Noise-robust Speech Recognition System
    Zheng Yu-Zheng
    Li Go-Liang
    Zhang Jie
    Liu Xiao-ying
    [J]. PROCEEDINGS OF 2009 INTERNATIONAL WORKSHOP ON INFORMATION SECURITY AND APPLICATION, 2009, : 168 - 172
  • [28] Gammatone Wavelet Cepstral Coefficients for Robust Speech Recognition
    Adiga, Aniruddha
    Magimai-Doss, Mathew
    Seelamantula, Chandra Sekhar
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE OF IEEE REGION 10 (TENCON), 2013,
  • [29] Bark-wavelet Analysis and HilberteHuang Transform for Underwater Target Recognition
    Zeng, Xiang-yang
    Wang, Shu-guang
    [J]. DEFENCE TECHNOLOGY, 2013, 9 (02): : 115 - 120
  • [30] Robust features for noisy speech recognition based on temporal trajectory filtering of short-time autocorrelation sequences
    Yuo, KH
    Wang, HC
    [J]. SPEECH COMMUNICATION, 1999, 28 (01) : 13 - 24