On the use of evolutionary algorithms to improve the robustness of continuous speech recognition systems in adverse conditions

被引:1
|
作者
Selouani, SA
O'Shaughnessy, D
机构
[1] Univ Moncton, Secteur Gest Informat, Shippegan, NB E8S 1P6, Canada
[2] Univ Quebec, INRS Energie Mat Telecommun, Montreal, PQ H5A 1K6, Canada
关键词
speech recognition; genetic algorithms; Karhunen-Loeve transform; hidden Markov models; robustness;
D O I
10.1155/S1110865703302070
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Limiting the decrease in performance due to acoustic environment changes remains a major challenge for continuous speech recognition (CSR) systems. We propose a novel approach which combines the Karhunen-Loeve transform (KLT) in the mel-frequency domain with a genetic algorithm (GA) to enhance the data representing corrupted speech. The idea consists of projecting noisy speech parameters onto the space generated by the genetically optimized principal axis issued from the KLT. The enhanced parameters increase the recognition rate for highly interfering noise environments. The proposed hybrid technique, when included in the front-end of an HTK-based CSR system, outperforms that of the conventional recognition process in severe interfering car noise environments for a wide range of signal-to-noise ratios (SNRs) varying from 16 dB to -4 dB. We also showed the effectiveness of the KLT-GA method in recognizing speech subject to telephone channel degradations.
引用
收藏
页码:814 / 823
页数:10
相关论文
共 50 条
  • [41] Robust parsing for word lattices in continuous speech recognition systems
    Momtazi, S.
    Sameti, H.
    Fazel-Zarandi, M.
    Bahrani, M.
    2007 9TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1-3, 2007, : 156 - 159
  • [42] Discriminative speaker adaptation in Persian continuous speech recognition systems
    Pirhosseinloo, Shadi
    Ganj, Farshad Almas
    4TH INTERNATIONAL CONFERENCE OF COGNITIVE SCIENCE, 2012, 32 : 296 - 301
  • [43] Optimisation of phonetic aware speech recognition through multi-objective evolutionary algorithms
    Bird, Jordan J.
    Wanner, Elizabeth
    Ekart, Aniko
    Faria, Diego R.
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 153
  • [44] A clustering algorithm for the fast match of acoustic conditions in continuous speech recognition
    Rodríguez, LJ
    Torres, MI
    PATTERN RECOGNITION AND IMAGE ANALYSIS, PT 2, PROCEEDINGS, 2005, 3523 : 562 - 570
  • [45] Improving of Feature Selection in Speech Emotion Recognition Based-on Hybrid Evolutionary Algorithms
    Langari, Shadi
    Marvi, Hossein
    Zahedi, Morteza
    INTERNATIONAL JOURNAL OF NONLINEAR ANALYSIS AND APPLICATIONS, 2020, 11 (01): : 81 - 92
  • [46] Application of feature subset selection based on evolutionary algorithms for automatic emotion recognition in speech
    Alvarez, Aitor
    Cearreta, Idoia
    Lopez, Juan Miguel
    Arruti, Andoni
    Lazkano, Elena
    Sierra, Basilio
    Garay, Nestor
    ADVANCES IN NONLINEAR SPEECH PROCESSING, 2007, 4885 : 273 - 281
  • [47] Effects of Vocabulary and Implicit Linguistic Knowledge on Speech Recognition in Adverse Listening Conditions
    Fletcher, Annalise
    McAuliffe, Megan
    Kerr, Sarah
    Sinex, Donal
    AMERICAN JOURNAL OF AUDIOLOGY, 2019, 28 (03) : 742 - 755
  • [48] Investigations of Issues for Using Multiple Acoustic Models to Improve Continuous Speech Recognition
    Zhang, Rong
    Rudnicky, Alexander I.
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 529 - 532
  • [49] Developing Different Test Conditions to Verify the Robustness and Versatility of Robotic Arms Controlled by Evolutionary Algorithms
    Szabo, Roland
    ELECTRONICS, 2024, 13 (11)
  • [50] Comparison of continuous speech recognition systems with unknown-word processing for speech disfluencies
    Toyohashi Univ of Technology, Toyohashi, Japan
    Syst Comput Jpn, 9 (43-53):