The regularized SNN-TA model for recognition of noisy speech

被引:2
|
作者
Trentin, E [1 ]
Matassoni, M [1 ]
机构
[1] ITC Irst, Trent, Italy
关键词
D O I
10.1109/IJCNN.2000.861441
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Segmental Neural Network (SNN) architecture was introduced at BBN by Zavaliagkos et al. for rescoring the N-best hypothesis yielded by a standard Continuous Density hidden Markov model (CDHMM) applied to Automatic Speech Recognition. An enhanced connectionist model, called SNN with trainable amplitude of activation functions (SNN-TA) is first used in this paper instead of the CDHMM to perform the recognition of isolated words. Viterbi-based segmentation is then introduced, relying on the level building algorithm, that can be combined with the SNN-TA to obtain a hybrid framework for continuous speech recognition. The present paradigm is applied to the recognition of isolated digits, collected in a real car environment under several noisy conditions (traffic, speed, road conditions, etc.) using a microphone placed far from the talker. We stress the fact that robustness to noise can be increased by improving the generalization capabilities of the speech recognizer. In this perspective, while CDHMMs completely lack of a proper regularization theory, a regularized SNN-TA model is discussed, which yields effective generalization and noise-tolerance, outperforming the CDHMM on the noisy task under consideration.
引用
收藏
页码:97 / 102
页数:6
相关论文
共 50 条
  • [21] Problems and solutions for noisy speech recognition
    Haton, J.-P.
    Journal De Physique, 1994, 4 (5 pt 1) : 439 - 448
  • [22] Auditory model for robust speech recognition in real world noisy environments
    Kim, DS
    Lee, SY
    Kil, RM
    Zhu, XL
    ELECTRONICS LETTERS, 1997, 33 (01) : 12 - 13
  • [23] A probabilistic union model with automatic order selection for noisy speech recognition
    Jancovic, P
    Ming, J
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2001, 110 (03): : 1641 - 1648
  • [24] A probabilistic union model with automatic order selection for noisy speech recognition
    Jančovič, P.
    Ming, J.
    1641, Acoustical Society of America (110):
  • [25] PROBLEMS AND SOLUTIONS FOR NOISY SPEECH RECOGNITION
    HATON, JP
    JOURNAL DE PHYSIQUE IV, 1994, 4 (C5): : 439 - 448
  • [26] A new noisy speech recognition method
    Zhao, XQ
    Wang, J
    International Symposium on Communications and Information Technologies 2005, Vols 1 and 2, Proceedings, 2005, : 282 - 286
  • [27] Speech emotion recognition in noisy environment
    Chenchah, Farah
    Lachiri, Zied
    2016 2ND INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP), 2016, : 788 - 792
  • [28] SPEECH RECOGNITION IN NOISY ENVIRONMENTS - A SURVEY
    GONG, YF
    SPEECH COMMUNICATION, 1995, 16 (03) : 261 - 291
  • [29] Study of speech recognition in noisy environment
    Kreisinger, T
    Pollak, P
    Sovka, P
    Uhlir, J
    SIGNAL ANALYSIS & PREDICTION I, 1997, : 334 - 337
  • [30] Robust recognition of noisy speech using speech enhancement
    Xu, YF
    Zhang, JJ
    Yao, KS
    Cao, ZG
    Ma, ZX
    2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 734 - 737