On the use of evolutionary algorithms to improve the robustness of continuous speech recognition systems in adverse conditions

被引:1
|
作者
Selouani, SA
O'Shaughnessy, D
机构
[1] Univ Moncton, Secteur Gest Informat, Shippegan, NB E8S 1P6, Canada
[2] Univ Quebec, INRS Energie Mat Telecommun, Montreal, PQ H5A 1K6, Canada
关键词
speech recognition; genetic algorithms; Karhunen-Loeve transform; hidden Markov models; robustness;
D O I
10.1155/S1110865703302070
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Limiting the decrease in performance due to acoustic environment changes remains a major challenge for continuous speech recognition (CSR) systems. We propose a novel approach which combines the Karhunen-Loeve transform (KLT) in the mel-frequency domain with a genetic algorithm (GA) to enhance the data representing corrupted speech. The idea consists of projecting noisy speech parameters onto the space generated by the genetically optimized principal axis issued from the KLT. The enhanced parameters increase the recognition rate for highly interfering noise environments. The proposed hybrid technique, when included in the front-end of an HTK-based CSR system, outperforms that of the conventional recognition process in severe interfering car noise environments for a wide range of signal-to-noise ratios (SNRs) varying from 16 dB to -4 dB. We also showed the effectiveness of the KLT-GA method in recognizing speech subject to telephone channel degradations.
引用
收藏
页码:814 / 823
页数:10
相关论文
共 50 条
  • [1] On the Use of Evolutionary Algorithms to Improve the Robustness of Continuous Speech Recognition Systems in Adverse Conditions
    Sid-Ahmed Selouani
    Douglas O'Shaughnessy
    EURASIP Journal on Advances in Signal Processing, 2003
  • [2] On the use of evolutionary algorithms to improve the robustness of continuous speech recognition systems in adverse conditions
    Selouani, S.-A. (selouani@umcs.ca), 1600, Hindawi Publishing Corporation (2003):
  • [3] Towards improving speech detection robustness for speech recognition in adverse conditions
    Karray, L
    Martin, A
    SPEECH COMMUNICATION, 2003, 40 (03) : 261 - 276
  • [4] Excluding fitness helps improve robustness of evolutionary algorithms
    Sprogar, M
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 4, PROCEEDINGS, 2005, 3684 : 8 - 14
  • [5] Speech recognition in adverse conditions: A review
    Mattys, Sven L.
    Davis, Matthew H.
    Bradlow, Ann R.
    Scott, Sophie K.
    LANGUAGE AND COGNITIVE PROCESSES, 2012, 27 (7-8): : 953 - 978
  • [6] Adding Noise to Improve Noise Robustness in Speech Recognition
    Morales, Nicolas
    Gu, Liang
    Gao, Yuqing
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 861 - +
  • [7] Exploitation of spectral variance to improve robustness in speech recognition
    Xu, H
    Tan, ZH
    Dalsgaard, P
    Lindberg, B
    ELECTRONICS LETTERS, 2006, 42 (05) : 312 - 314
  • [8] PARALLEL ALGORITHMS FOR SYLLABLE RECOGNITION IN CONTINUOUS SPEECH
    DEMORI, R
    LAFACE, P
    MONG, Y
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1985, 7 (01) : 56 - 69
  • [9] Fast search algorithms for continuous speech recognition
    Zhao, J
    Hamaker, J
    Deshmukh, N
    Ganapathiraju, A
    Picone, J
    IEEE SOUTHEASTCON '99, PROCEEDINGS, 1999, : 36 - 39
  • [10] SPEECH ENHANCEMENT AND FEATURES COMPENSATION ALGORITHMS FOR CONTINUOUS SPEECH RECOGNITION
    Arcos, Christian
    Grivet, Marco
    Alcaim, Abraham
    2014 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (CHINASIP), 2014, : 27 - 31