Combining feature compensation and Weighted Viterbi Decoding for noise robust speech recognition with limited adaptation data

被引：0

作者：

Cui, XD ^{[1
]}

Alwan, A ^{[1
]}

机构：

[1] Univ Calif Los Angeles, Dept Elect Engn, Los Angeles, CA 90095 USA

来源：

2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING | 2004年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Acoutic models trained with clean speech signals suffer in the presence of background noise. In some situations, only a limited amount of noisy data of the new environment is available based on which the clean models could be adapted. A feature compensation approach employing polynomial regression of the signal-to-noise ratio (SNR) is proposed in this paper. While clean acoustic models remain unchanged, a bias which is a polynomial function of utterance SNR is estimated and removed from the noisy feature. Depending on the amount of noisy data available, the algorithm could be flexibly carried out at different levels of granularity. Based on the Euclidean distance, the similarity between the residual distribution and the clean models are estimated and used as the confidence factor in a back-end Weighted Viterbi Decoding (WVD) algorithm. With limited amounts of noisy data, the feature compensation algorithm outperforms Maximum Likelihood Linear Regression (MLLR) for the Aurora2 database. Weighted Viterbi decoding further improves recognition accuracy.

引用

页码：969 / 972

页数：4

共 50 条

[21] ROBUST FEATURE SPACE ADAPTATION FOR TELEPHONY SPEECH RECOGNITION
Lei, Xin
Hamaker, Jon
He, Xiaodong
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 773 - +
[22] Combining speech enhancement and auditory feature extraction for robust speech recognition
Kleinschmidt, M
Tchorz, J
Kollmeier, B
SPEECH COMMUNICATION, 2001, 34 (1-2) : 75 - 91
[23] Issues with uncertainty decoding for noise robust automatic speech recognition
Liao, H.
Gales, M. J. F.
SPEECH COMMUNICATION, 2008, 50 (04) : 265 - 277
[24] Online feature compensation using modified quantile based noise estimation for robust speech recognition
Lee, Heungkyu
Kwon, Ohil
Kim, June
ADVANCES IN INTELLIGENT IT: ACTIVE MEDIA TECHNOLOGY 2006, 2006, 138 : 236 - 242
[25] ROBUST SPEECH RECOGNITION USING DYNAMIC NOISE ADAPTATION
Rennie, Steven
Dognin, Pierre
Fousek, Petr
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4592 - 4595
[26] AN INTEGRATED APPROACH TO FEATURE COMPENSATION COMBINING PARTICLE FILTERS AND HIDDEN MARKOV MODELS FOR ROBUST SPEECH RECOGNITION
Mushtaq, Aleem
Hui-Lee, Chin
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4757 - 4760
[27] Model-based feature compensation for robust speech recognition
Shen, Haifeng
Li, Qunxia
Guo, Jun
Liu, Gang
FUNDAMENTA INFORMATICAE, 2006, 72 (04) : 529 - 539
[28] Model-based feature compensation for robust speech recognition
School of Information Engineering, Beijing University of Posts and Telecommunications, Beijing, 100876, China
不详
不详
Fundam Inf, 2006, 4 (529-539):
[29] A Particle Filter Feature Compensation Approach to Robust Speech Recognition
Mushtaq, Aleem
Tsao, Yu
Hui-Lee, Chin
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2054 - +
[30] On stochastic feature and model compensation approaches to robust speech recognition
Lee, CH
SPEECH COMMUNICATION, 1998, 25 (1-3) : 29 - 47

← 1 2 3 4 5 →