Robust speech recognition using probabilistic union models

被引：27

作者：

Ming, J ^{[1
]}

Jancovic, P ^{[1
]}

Smith, FJ ^{[1
]}

机构：

[1] Queens Univ Belfast, Dept Comp Sci, Belfast BT7 1NN, Antrim, North Ireland

来源：

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 2002年 / 10卷 / 06期

基金：

英国工程与自然科学研究理事会;

关键词：

acoustic modeling; noise robustness; probabilistic union models; speech recognition;

D O I：

10.1109/TSA.2002.803439

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper introduces a new statistical approach, namely the probabilistic union model, for speech recognition involving partial, unknown frequency-band corruption. Partial frequency-band corruption accounts for the effect of a family of real-world noises. Previous methods based on the missing feature theory usually require the identity of the noisy bands. This identification can be difficult for unexpected noise with unknown, time-varying band characteristics. The new model combines the local frequency-band information based on the union of random events, to reduce the dependence of the model on information about the noise. This model partially accomplishes the target: offering robustness to partial frequency-band corruption, while requiring no information about the noise. This paper introduces the theory and implementation of the union model, and is focused on several important advances. These new developments include a new algorithm for automatic order selection, a generalization of the modeling principle to accommodate partial feature stream corruption, and a combination of the union model with conventional noise reduction techniques to deal with a mixture of stationary noise and unknown, nonstationary noise. For the evaluation, we used the TIDIGITS database for speaker-independent connected digit recognition. The utterances were corrupted by various types of additive noise, stationary or time-varying, assuming no knowledge about the noise characteristics. The results indicate that the new model offers significantly improved robustness in comparison to other models.

引用

页码：403 / 414

页数：12

共 50 条

[1] Speech recognition using probabilistic and statistical models
Singh, Amber
Anand, R. S.
[J]. 2015 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (CICN), 2015, : 686 - 690
[2] A probabilistic union model for sub-band based robust speech recognition
Ming, J
Smith, FJ
[J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1787 - 1790
[3] ROBUST SPEECH RECOGNITION USING MULTIVARIATE COPULA MODELS
Bayestehtashk, Alireza
Shafran, Izhak
Babaeian, Amir
[J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5890 - 5894
[4] ROBUST SPEECH RECOGNITION USING MULTIPLE PRIOR MODELS FOR SPEECH RECONSTRUCTION
Narayanan, Arun
Zhao, Xiaojia
Wang, DeLiang
Fosler-Lussier, Eric
[J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4800 - 4803
[5] Speaker adaptation techniques for speech recognition using probabilistic models
Shinoda, K
[J]. ELECTRONICS AND COMMUNICATIONS IN JAPAN PART III-FUNDAMENTAL ELECTRONIC SCIENCE, 2005, 88 (12): : 25 - 42
[6] A posterior union model for robust speech recognition
Ming, J
Lin, J
[J]. ELECTRONICS LETTERS, 2003, 39 (01) : 162 - 163
[7] Probabilistic class histogram equalization for robust speech recognition
Suh, Youngjoo
Ji, Mikyong
Kim, Hoirin
[J]. IEEE SIGNAL PROCESSING LETTERS, 2007, 14 (04) : 287 - 290
[8] Towards robust and adaptive speech recognition models
Bourlard, H
Bengio, S
Weber, K
[J]. MATHEMATICAL FOUNDATIONS OF SPEECH AND LANGUAGE PROCESSING, 2004, 138 : 169 - 189
[9] A posterior union model with applications to robust speech and speaker recognition
Ming, Ji
Lin, Jie
Smith, F. Jack
[J]. EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2006, 2006 (1)
[10] A Posterior Union Model with Applications to Robust Speech and Speaker Recognition
Ji Ming
Jie Lin
F. Jack Smith
[J]. EURASIP Journal on Advances in Signal Processing, 2006

← 1 2 3 4 5 →