The regularized SNN-TA model for recognition of noisy speech

被引：2

作者：

Trentin, E ^{[1
]}

Matassoni, M ^{[1
]}

机构：

[1] ITC Irst, Trent, Italy

来源：

IJCNN 2000: PROCEEDINGS OF THE IEEE-INNS-ENNS INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOL V | 2000年

关键词：

D O I：

10.1109/IJCNN.2000.861441

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The Segmental Neural Network (SNN) architecture was introduced at BBN by Zavaliagkos et al. for rescoring the N-best hypothesis yielded by a standard Continuous Density hidden Markov model (CDHMM) applied to Automatic Speech Recognition. An enhanced connectionist model, called SNN with trainable amplitude of activation functions (SNN-TA) is first used in this paper instead of the CDHMM to perform the recognition of isolated words. Viterbi-based segmentation is then introduced, relying on the level building algorithm, that can be combined with the SNN-TA to obtain a hybrid framework for continuous speech recognition. The present paradigm is applied to the recognition of isolated digits, collected in a real car environment under several noisy conditions (traffic, speed, road conditions, etc.) using a microphone placed far from the talker. We stress the fact that robustness to noise can be increased by improving the generalization capabilities of the speech recognizer. In this perspective, while CDHMMs completely lack of a proper regularization theory, a regularized SNN-TA model is discussed, which yields effective generalization and noise-tolerance, outperforming the CDHMM on the noisy task under consideration.

引用

页码：97 / 102

页数：6

共 50 条

[21] Problems and solutions for noisy speech recognition
Haton, J.-P.
Journal De Physique, 1994, 4 (5 pt 1) : 439 - 448
[22] Auditory model for robust speech recognition in real world noisy environments
Kim, DS
Lee, SY
Kil, RM
Zhu, XL
ELECTRONICS LETTERS, 1997, 33 (01) : 12 - 13
[23] A probabilistic union model with automatic order selection for noisy speech recognition
Jancovic, P
Ming, J
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2001, 110 (03): : 1641 - 1648
[24] A probabilistic union model with automatic order selection for noisy speech recognition
Jančovič, P.
Ming, J.
1641, Acoustical Society of America (110):
[25] PROBLEMS AND SOLUTIONS FOR NOISY SPEECH RECOGNITION
HATON, JP
JOURNAL DE PHYSIQUE IV, 1994, 4 (C5): : 439 - 448
[26] A new noisy speech recognition method
Zhao, XQ
Wang, J
International Symposium on Communications and Information Technologies 2005, Vols 1 and 2, Proceedings, 2005, : 282 - 286
[27] Speech emotion recognition in noisy environment
Chenchah, Farah
Lachiri, Zied
2016 2ND INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP), 2016, : 788 - 792
[28] SPEECH RECOGNITION IN NOISY ENVIRONMENTS - A SURVEY
GONG, YF
SPEECH COMMUNICATION, 1995, 16 (03) : 261 - 291
[29] Study of speech recognition in noisy environment
Kreisinger, T
Pollak, P
Sovka, P
Uhlir, J
SIGNAL ANALYSIS & PREDICTION I, 1997, : 334 - 337
[30] Robust recognition of noisy speech using speech enhancement
Xu, YF
Zhang, JJ
Yao, KS
Cao, ZG
Ma, ZX
2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 734 - 737

← 1 2 3 4 5 →