Frequency and wavelet filtering for robust speech recognition

被引:0
|
作者
Deviren, M [1 ]
Daoudi, K [1 ]
机构
[1] INRIA, LORIA, Speech Grp, F-54602 Villers Les Nancy, France
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Mel-frequency cepstral coefficients (MFCC) are the most widely used features in current speech recognition systems. However, they have a poor physical interpretation and they do not lie in the frequency domain. Frequency filtering (FF) is a technique that has been recently developed to design frequency-localized speech features that perform similar to MFCC in terms of recognition performances. Motivated by our desire to build time-frequency speech models, we wanted to use the FF technique as front-end. However, when evaluating FF on the Aurora-3 database we found some discrepancies in the highly mismatch case. This led us to put FF in another perspective: the wavelet transform. By doing so, we were able to explain the discrepancies and to achieve significant improvements in recognition in the highly mismatch case.
引用
收藏
页码:452 / 460
页数:9
相关论文
共 50 条
  • [11] Adaptive ARMA filtering and energy normalization for robust speech recognition
    Golshan, F.
    Ahadi, S. M.
    Shariati, S. S.
    [J]. 2007 9TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1-3, 2007, : 1059 - 1062
  • [12] Robust speech, recognition using adaptively denoised wavelet coefficients
    Akyol, E
    Erzin, E
    Tekalp, AM
    [J]. PROCEEDINGS OF THE IEEE 12TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, 2004, : 407 - 409
  • [13] The perceptual wavelet feature for noise robust Vietnamese speech recognition
    Trung, Nguyen Quoc
    Nghia, Phung Trung
    [J]. 2008 SECOND INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND ELECTRONICS, 2008, : 255 - +
  • [14] Filtering of filter-bank energies for robust speech recognition
    Jung, HY
    [J]. ETRI JOURNAL, 2004, 26 (03) : 273 - 276
  • [15] Denoising on adapted wavelet packets domain for robust speech recognition
    Chang, SW
    Kwon, Y
    Yang, SI
    [J]. ISIE 2001: IEEE INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS PROCEEDINGS, VOLS I-III, 2001, : 497 - 500
  • [16] Robust features for speech recognition based on admissible wavelet packets
    Farooq, O
    Datta, S
    [J]. ELECTRONICS LETTERS, 2001, 37 (25) : 1554 - 1556
  • [17] Denoising on Wavelet Compromise Threshold Algorithm for Robust Speech Recognition
    Liu Xuefei
    Hu Chunhai
    [J]. PROCEEDINGS OF THE SECOND INTERNATIONAL SYMPOSIUM ON TEST AUTOMATION & INSTRUMENTATION, VOL. 3, 2008, : 1718 - 1721
  • [18] Robust Speech Enhancement Using Dabauchies Wavelet Based Adaptive Wavelet Thresholding for the Development of Robust Automatic Speech Recognition: A Comprehensive Review
    Shanthamallappa, Mahadevaswamy
    [J]. WIRELESS PERSONAL COMMUNICATIONS, 2024, 137 (04) : 2085 - 2119
  • [19] Instantaneous Frequency Features for Noise Robust Speech Recognition
    Nayak, Shekhar
    Dhar, Shashank B.
    Bhati, Saurabhchand
    Bramhendra, Koilakuntla
    Murty, K. Sri Rama
    [J]. 2019 25TH NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2019,
  • [20] Mel Sub-Band Filtering and Compression for Robust Speech Recognition
    Nasersharif, Babak
    Akbari, Ahmad
    Homayounpour, Mohammad Mehdi
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 105 - +