Signal bias removal by maximum likelihood estimation for robust telephone speech recognition

被引：0

作者：

Rahim, MG

Juang, BH

机构：

来源：

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 1996年 / 4卷 / 01期

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

An acoustical mismatch between the training and the testing conditions of hidden Markov model (HMM)-based speech recognition systems often causes a severe degradation in the recognition performance. In telephone speech recognition, for example, undesirable signal components due to ambient noise and channel distortion, as well as due to different variations of telephone handsets render the recognizer unusable for real-world applications, This paper presents a signal bias removal (SBR) method based on maximum likelihood estimation for the minimization of these undesirable effects, The proposed method is readily applicable in various architectures, i.e., discrete (vector-quantization based), semicontinuous and continuous density HMM, In this paper, the SBR method, integrated into a discrete density HMM, is applied to telephone speech recognition where the contamination due to extraneous signal components is assumed to be unknown, To enable real-time implementation, a sequential method for the estimation of the bias is presented, Experimental results for speaker-independent connected digit recognition show a reduction in the per digit error rate by up to 41% and 14% during mismatched and matched training and testing conditions, respectively.

引用

页码：19 / 30

页数：12

共 50 条

[11] Maximum-likelihood approach to stochastic matching for robust speech recognition
Sankar, A
Lee, CH
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1996, 4 (03): : 190 - 202
[12] Robust Maximum Likelihood Estimation
Bertsimas, Dimitris
Nohadani, Omid
[J]. INFORMS JOURNAL ON COMPUTING, 2019, 31 (03) : 445 - 458
[13] Signal bias removal based GMM for robust speaker recognition
Kim, YJ
Chung, JH
[J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 4163 - 4163
[14] Robust speech recognition in telephone network
Han, MS
Park, GB
Park, JG
Han, JQ
[J]. PROGRESS IN CONNECTIONIST-BASED INFORMATION SYSTEMS, VOLS 1 AND 2, 1998, : 1103 - 1106
[15] Signal adaptive spectral envelope estimation for robust speech recognition
Woelfel, Matthias
[J]. SPEECH COMMUNICATION, 2009, 51 (06) : 551 - 561
[16] Signal bias removal with orthogonal transform for adverse Mandarin speech recognition
Wang, WJ
Chen, SH
[J]. ELECTRONICS LETTERS, 2000, 36 (09) : 851 - 852
[17] Maximum Likelihood Clustering of Gaussians for Speech Recognition
Kannan, A.
Ostendorf, M.
Rohlicek, J. R.
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (03): : 453 - 455
[18] Maximum Likelihood Model Adaptation Using Piecewise Linear Transformation for Robust Speech Recognition
Lue, Yong
Wu, Zhenyang
[J]. ISCE: 2009 IEEE 13TH INTERNATIONAL SYMPOSIUM ON CONSUMER ELECTRONICS, VOLS 1 AND 2, 2009, : 479 - 481
[19] BIAS ERROR IN MAXIMUM-LIKELIHOOD-ESTIMATION
KOCH, SP
[J]. JOURNAL OF HYDROLOGY, 1991, 122 (1-4) : 289 - 300
[20] Channel compensation for robust telephone speech recognition
Han, JQ
Han, MS
Gao, W
[J]. IEEE TENCON'97 - IEEE REGIONAL 10 ANNUAL CONFERENCE, PROCEEDINGS, VOLS 1 AND 2: SPEECH AND IMAGE TECHNOLOGIES FOR COMPUTING AND TELECOMMUNICATIONS, 1997, : 169 - 172

← 1 2 3 4 5 →