Frequency and wavelet filtering for robust speech recognition

被引:0
|
作者
Deviren, M [1 ]
Daoudi, K [1 ]
机构
[1] INRIA, LORIA, Speech Grp, F-54602 Villers Les Nancy, France
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Mel-frequency cepstral coefficients (MFCC) are the most widely used features in current speech recognition systems. However, they have a poor physical interpretation and they do not lie in the frequency domain. Frequency filtering (FF) is a technique that has been recently developed to design frequency-localized speech features that perform similar to MFCC in terms of recognition performances. Motivated by our desire to build time-frequency speech models, we wanted to use the FF technique as front-end. However, when evaluating FF on the Aurora-3 database we found some discrepancies in the highly mismatch case. This led us to put FF in another perspective: the wavelet transform. By doing so, we were able to explain the discrepancies and to achieve significant improvements in recognition in the highly mismatch case.
引用
收藏
页码:452 / 460
页数:9
相关论文
共 50 条
  • [1] Perceptual wavelet filtering for robust speech recognition
    Van Pham, Tuan
    Stark, Michael
    Kubin, Gernot
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4385 - 4388
  • [2] Dereverberation based on Wavelet Packet Filtering for Robust Automatic Speech Recognition
    Gomez, Randy
    Kawahara, Tatsuya
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1242 - 1245
  • [3] Spectrum filtering with FRM for robust speech recognition
    Hayasaka, Noboru
    Miyanaga, Yoshikazu
    [J]. 2006 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11, PROCEEDINGS, 2006, : 3285 - +
  • [4] Matched filtering approach to robust speech recognition
    Avadhanulu, J.V.
    Sreenivas, T.V.
    [J]. Journal of the Indian Institute of Science, 79 (03): : 185 - 196
  • [5] Time and frequency filtering of filter-bank energies for robust HMM speech recognition
    Nadeu, C
    Macho, D
    Hernando, J
    [J]. SPEECH COMMUNICATION, 2001, 34 (1-2) : 93 - 114
  • [6] Robust speech recognition features based on temporal trajectory filtering of frequency band spectrum
    Shen, JL
    Hwang, WL
    Lee, LS
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 881 - 884
  • [7] Gammatone Wavelet Cepstral Coefficients for Robust Speech Recognition
    Adiga, Aniruddha
    Magimai-Doss, Mathew
    Seelamantula, Chandra Sekhar
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE OF IEEE REGION 10 (TENCON), 2013,
  • [8] Robust speech recognition using wavelet coefficient features
    Gupta, M
    Gilbert, A
    [J]. ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS, 2001, : 445 - 448
  • [9] Denoising Using Optimized Wavelet Filtering for Automatic Speech Recognition
    Gomez, Randy
    Kawahara, Tatsuya
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1684 - 1687
  • [10] Speech feature extraction based on wavelet modulation scale for robust speech recognition
    Ma, Xin
    Zhou, Weidong
    Ju, Fang
    Jiang, Qi
    [J]. NEURAL INFORMATION PROCESSING, PT 2, PROCEEDINGS, 2006, 4233 : 499 - 505