The regularized SNN-TA model for recognition of noisy speech

被引:2
|
作者
Trentin, E [1 ]
Matassoni, M [1 ]
机构
[1] ITC Irst, Trent, Italy
关键词
D O I
10.1109/IJCNN.2000.861441
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Segmental Neural Network (SNN) architecture was introduced at BBN by Zavaliagkos et al. for rescoring the N-best hypothesis yielded by a standard Continuous Density hidden Markov model (CDHMM) applied to Automatic Speech Recognition. An enhanced connectionist model, called SNN with trainable amplitude of activation functions (SNN-TA) is first used in this paper instead of the CDHMM to perform the recognition of isolated words. Viterbi-based segmentation is then introduced, relying on the level building algorithm, that can be combined with the SNN-TA to obtain a hybrid framework for continuous speech recognition. The present paradigm is applied to the recognition of isolated digits, collected in a real car environment under several noisy conditions (traffic, speed, road conditions, etc.) using a microphone placed far from the talker. We stress the fact that robustness to noise can be increased by improving the generalization capabilities of the speech recognizer. In this perspective, while CDHMMs completely lack of a proper regularization theory, a regularized SNN-TA model is discussed, which yields effective generalization and noise-tolerance, outperforming the CDHMM on the noisy task under consideration.
引用
收藏
页码:97 / 102
页数:6
相关论文
共 50 条
  • [1] Noise-tolerant speech recognition: the SNN-TA approach
    Trentin, E
    Matassoni, M
    INFORMATION SCIENCES, 2003, 156 (1-2) : 55 - 69
  • [2] SPEECH RECOGNITION WITH NO SPEECH OR WITH NOISY SPEECH
    Krishna, Gautam
    Co Tran
    Yu, Jianguo
    Tewfik, Ahmed H.
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1090 - 1094
  • [4] Multi-model approach for noisy speech recognition
    Guan, CT
    Leung, SH
    Lau, WH
    ELECTRONICS LETTERS, 1998, 34 (01) : 30 - 32
  • [5] Advancing Speech Recognition With No Speech Or With Noisy Speech
    Krishna, Gautam
    Tran, Co
    Carnahan, Mason
    Tewfik, Ahmed
    2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [6] A Study on Noisy Speech Recognition
    Saeed, Khalid
    Szczepanski, Adam
    ICBAKE: 2009 INTERNATIONAL CONFERENCE ON BIOMETRICS AND KANSEI ENGINEERING, 2009, : 142 - 147
  • [7] A fast algorithm for parallel model combination for noisy speech recognition
    Hwang, TH
    Wang, HC
    COMPUTER SPEECH AND LANGUAGE, 2000, 14 (02): : 81 - 100
  • [8] HCRF-based Model Compensation for Noisy Speech Recognition
    Hong, Wei-Tyng
    2013 IEEE 17TH INTERNATIONAL SYMPOSIUM ON CONSUMER ELECTRONICS (ISCE), 2013, : 277 - 278
  • [9] An Improved Parallel Model Combination Method for Noisy Speech Recognition
    Veisi, Hadi
    Sameti, Hossein
    2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 237 - 242
  • [10] An improved noisy channel model for speech recognition error correction
    Li, Baoxiang
    Liu, Gang
    Guo, Jun
    Lu, Yueming
    International Journal of Advancements in Computing Technology, 2012, 4 (12) : 110 - 118