On the use of evolutionary algorithms to improve the robustness of continuous speech recognition systems in adverse conditions

被引:1
|
作者
Selouani, SA
O'Shaughnessy, D
机构
[1] Univ Moncton, Secteur Gest Informat, Shippegan, NB E8S 1P6, Canada
[2] Univ Quebec, INRS Energie Mat Telecommun, Montreal, PQ H5A 1K6, Canada
关键词
speech recognition; genetic algorithms; Karhunen-Loeve transform; hidden Markov models; robustness;
D O I
10.1155/S1110865703302070
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Limiting the decrease in performance due to acoustic environment changes remains a major challenge for continuous speech recognition (CSR) systems. We propose a novel approach which combines the Karhunen-Loeve transform (KLT) in the mel-frequency domain with a genetic algorithm (GA) to enhance the data representing corrupted speech. The idea consists of projecting noisy speech parameters onto the space generated by the genetically optimized principal axis issued from the KLT. The enhanced parameters increase the recognition rate for highly interfering noise environments. The proposed hybrid technique, when included in the front-end of an HTK-based CSR system, outperforms that of the conventional recognition process in severe interfering car noise environments for a wide range of signal-to-noise ratios (SNRs) varying from 16 dB to -4 dB. We also showed the effectiveness of the KLT-GA method in recognizing speech subject to telephone channel degradations.
引用
收藏
页码:814 / 823
页数:10
相关论文
共 50 条
  • [21] Using adaptive genetic algorithms to improve speech emotion recognition
    Sedaaghi, Mohammad H.
    Kotropoulos, Constantine
    Ververidis, Dimitrios
    2007 IEEE NINTH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2007, : 461 - +
  • [22] Feature extraction algorithms to improve the speech emotion recognition rate
    Anusha Koduru
    Hima Bindu Valiveti
    Anil Kumar Budati
    International Journal of Speech Technology, 2020, 23 : 45 - 55
  • [23] HMM Topology in Continuous Speech Recognition Systems
    Yared, Glauco F. G.
    Violaro, Fabio
    Selmini, Antonio Marcos
    PROCEEDINGS OF THE IEEE INTERNATIONAL TELECOMMUNICATIONS SYMPOSIUM, VOLS 1 AND 2, 2006, : 651 - 656
  • [24] The Combination of CMS with PMC for Improving Robustness of Speech Recognition Systems
    Veisi, Hadi
    Sameti, Hossein
    ADVANCES IN COMPUTER SCIENCE AND ENGINEERING, 2008, 6 : 825 - 829
  • [25] A Novel Approach for Auditory Spectrum Enhancement to Improve Speech Recognition's Robustness
    Salhi, Khaireddine
    Hajaiej, Zied
    Ellouze, Noureddine
    2015 IEEE 12TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS & DEVICES (SSD), 2015,
  • [26] RELATIVE DIFFICULTY AND ROBUSTNESS OF SPEECH RECOGNITION TASKS THAT USE GRAMMATICAL CONSTRAINTS
    SONDHI, MM
    LEVINSON, SE
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1977, 62 : S64 - S64
  • [27] A new phonetic model for continuous speech recognition systems
    Fagundes, RDR
    Corrêa, JS
    Dumouchel, P
    2002 6TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I AND II, 2002, : 572 - 575
  • [28] Large-Vocabulary Continuous Speech Recognition Systems
    Saon, George
    Chien, Jen-Tzung
    IEEE SIGNAL PROCESSING MAGAZINE, 2012, 29 (06) : 18 - 33
  • [29] Boosting systems for large vocabulary continuous speech recognition
    Saon, George
    Soltau, Hagen
    SPEECH COMMUNICATION, 2012, 54 (02) : 212 - 218
  • [30] Robustness of speech recognition using genetic algorithms and a Mel-cepstral subspace approach
    Selouani, SA
    O'Shaughnessy, D
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 201 - 204