Signal bias removal by maximum likelihood estimation for robust telephone speech recognition

被引:0
|
作者
Rahim, MG
Juang, BH
机构
来源
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
An acoustical mismatch between the training and the testing conditions of hidden Markov model (HMM)-based speech recognition systems often causes a severe degradation in the recognition performance. In telephone speech recognition, for example, undesirable signal components due to ambient noise and channel distortion, as well as due to different variations of telephone handsets render the recognizer unusable for real-world applications, This paper presents a signal bias removal (SBR) method based on maximum likelihood estimation for the minimization of these undesirable effects, The proposed method is readily applicable in various architectures, i.e., discrete (vector-quantization based), semicontinuous and continuous density HMM, In this paper, the SBR method, integrated into a discrete density HMM, is applied to telephone speech recognition where the contamination due to extraneous signal components is assumed to be unknown, To enable real-time implementation, a sequential method for the estimation of the bias is presented, Experimental results for speaker-independent connected digit recognition show a reduction in the per digit error rate by up to 41% and 14% during mismatched and matched training and testing conditions, respectively.
引用
收藏
页码:19 / 30
页数:12
相关论文
共 50 条
  • [1] A Variational Approach to Robust Maximum Likelihood Estimation for Speech Recognition
    Omar, Mohamed Kamal
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1049 - 1052
  • [2] Maximum likelihood joint estimation of channel and noise for robust speech recognition
    Zhao, YX
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1109 - 1112
  • [3] Estimation of channel bias for telephone speech recognition
    Chien, JT
    Wang, HC
    Lee, LM
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1840 - 1843
  • [4] Maximum likelihood polynomial regression for robust speech recognition
    L Yong WU Zhenyang (School of Information Science and Engineering
    [J]. Chinese Journal of Acoustics, 2011, 30 (03) : 358 - 370
  • [5] Maximum likelihood subband polynomial regression for robust speech recognition
    Lu, Yong
    Wu, Zhenyang
    [J]. APPLIED ACOUSTICS, 2013, 74 (05) : 640 - 646
  • [6] Integrated bias removal techniques for robust speech recognition
    Lawrence, C
    Rahim, M
    [J]. COMPUTER SPEECH AND LANGUAGE, 1999, 13 (03): : 283 - 298
  • [7] Convolutional Maximum-Likelihood Distortionless Response Beamforming With Steering Vector Estimation for Robust Speech Recognition
    Cho, Byung Joon
    Park, Hyung-Min
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 1352 - 1367
  • [8] MAXIMUM LIKELIHOOD ADAPTATION OF HISTOGRAM EQUALIZATION WITH CONSTRAINT FOR ROBUST SPEECH RECOGNITION
    Xiao, Xiong
    Li, Jinyu
    Chng, Eng Siong
    Li, Haizhou
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5480 - 5483
  • [9] Maximum likelihood sub-band adaptation for robust speech recognition
    Zhu, DL
    Nakamura, S
    Paliwal, KK
    Wang, RH
    [J]. SPEECH COMMUNICATION, 2005, 47 (03) : 243 - 264
  • [10] A combination of discriminative and Maximum Likelihood techniques for noise robust speech recognition
    Laurila, K
    Vasilache, M
    Viikki, O
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 85 - 88