Signal bias removal by maximum likelihood estimation for robust telephone speech recognition

被引:0
|
作者
Rahim, MG
Juang, BH
机构
来源
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
An acoustical mismatch between the training and the testing conditions of hidden Markov model (HMM)-based speech recognition systems often causes a severe degradation in the recognition performance. In telephone speech recognition, for example, undesirable signal components due to ambient noise and channel distortion, as well as due to different variations of telephone handsets render the recognizer unusable for real-world applications, This paper presents a signal bias removal (SBR) method based on maximum likelihood estimation for the minimization of these undesirable effects, The proposed method is readily applicable in various architectures, i.e., discrete (vector-quantization based), semicontinuous and continuous density HMM, In this paper, the SBR method, integrated into a discrete density HMM, is applied to telephone speech recognition where the contamination due to extraneous signal components is assumed to be unknown, To enable real-time implementation, a sequential method for the estimation of the bias is presented, Experimental results for speaker-independent connected digit recognition show a reduction in the per digit error rate by up to 41% and 14% during mismatched and matched training and testing conditions, respectively.
引用
收藏
页码:19 / 30
页数:12
相关论文
共 50 条
  • [11] Maximum-likelihood approach to stochastic matching for robust speech recognition
    Sankar, A
    Lee, CH
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1996, 4 (03): : 190 - 202
  • [12] Robust Maximum Likelihood Estimation
    Bertsimas, Dimitris
    Nohadani, Omid
    [J]. INFORMS JOURNAL ON COMPUTING, 2019, 31 (03) : 445 - 458
  • [13] Signal bias removal based GMM for robust speaker recognition
    Kim, YJ
    Chung, JH
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 4163 - 4163
  • [14] Robust speech recognition in telephone network
    Han, MS
    Park, GB
    Park, JG
    Han, JQ
    [J]. PROGRESS IN CONNECTIONIST-BASED INFORMATION SYSTEMS, VOLS 1 AND 2, 1998, : 1103 - 1106
  • [15] Signal adaptive spectral envelope estimation for robust speech recognition
    Woelfel, Matthias
    [J]. SPEECH COMMUNICATION, 2009, 51 (06) : 551 - 561
  • [16] Signal bias removal with orthogonal transform for adverse Mandarin speech recognition
    Wang, WJ
    Chen, SH
    [J]. ELECTRONICS LETTERS, 2000, 36 (09) : 851 - 852
  • [17] Maximum Likelihood Clustering of Gaussians for Speech Recognition
    Kannan, A.
    Ostendorf, M.
    Rohlicek, J. R.
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (03): : 453 - 455
  • [18] Maximum Likelihood Model Adaptation Using Piecewise Linear Transformation for Robust Speech Recognition
    Lue, Yong
    Wu, Zhenyang
    [J]. ISCE: 2009 IEEE 13TH INTERNATIONAL SYMPOSIUM ON CONSUMER ELECTRONICS, VOLS 1 AND 2, 2009, : 479 - 481
  • [19] BIAS ERROR IN MAXIMUM-LIKELIHOOD-ESTIMATION
    KOCH, SP
    [J]. JOURNAL OF HYDROLOGY, 1991, 122 (1-4) : 289 - 300
  • [20] Channel compensation for robust telephone speech recognition
    Han, JQ
    Han, MS
    Gao, W
    [J]. IEEE TENCON'97 - IEEE REGIONAL 10 ANNUAL CONFERENCE, PROCEEDINGS, VOLS 1 AND 2: SPEECH AND IMAGE TECHNOLOGIES FOR COMPUTING AND TELECOMMUNICATIONS, 1997, : 169 - 172