RAPID JOINT SPEAKER AND NOISE COMPENSATION FOR ROBUST SPEECH RECOGNITION

被引:0
|
作者
Chin, K. K. [1 ]
Xu, Haitian [1 ]
Gales, Mark J. F. [1 ]
Breslin, Catherine [1 ]
Knill, Kate [1 ]
机构
[1] Toshiba Res Europe Ltd, Cambridge Res Lab, Cambridge, England
关键词
Speaker adaptation; Noise compensation; Robust ASR; Rapid adaptation;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
For speech recognition, mismatches between training and testing for speaker and noise are normally handled separately. The work presented in this paper aims at jointly applying speaker adaptation and model-based noise compensation by embedding speaker adaptation as part of the noise mismatch function. The proposed method gives a faster and more optimum adaptation compared to compensating for these two factors separately. It is also more consistent with respect to the basic assumptions of speaker and noise adaptation. Experimental results show significant and consistent gains from the proposed method.
引用
收藏
页码:5500 / 5503
页数:4
相关论文
共 50 条
  • [21] A Noise Robust Speech Recognition Method Using Model Compensation Based on Speech Enhancement
    Shen, Guanghu
    Jung, Ho-Youl
    Chung, Hyun-Yeol
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2008, 27 (04): : 191 - 199
  • [22] An integrated study of speaker normalisation and HMM adaptation for noise robust speaker-independent speech recognition
    Hariharan, R
    Viikki, O
    SPEECH COMMUNICATION, 2002, 37 (3-4) : 349 - 361
  • [23] Model-space compensation of microphone and noise for speaker-independent speech recognition
    Gong, YF
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 660 - 663
  • [24] A method of joint compensation of additive and convolutive distortions for speaker-independent speech recognition
    Gong, YF
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (05): : 975 - 983
  • [25] MULTILEVEL SPEECH INTELLIGIBILITY FOR ROBUST SPEAKER RECOGNITION
    Nemala, Sridhar Krishna
    Elhilali, Mounya
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4393 - 4396
  • [26] Maximum likelihood joint estimation of channel and noise for robust speech recognition
    Zhao, YX
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1109 - 1112
  • [27] Joint Uncertainty Decoding With Predictive Methods for Noise Robust Speech Recognition
    Xu, Haitian
    Gales, Mark J. F.
    Chin, K. K.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (06): : 1665 - 1676
  • [28] Adaptive compensation for robust speech recognition
    Lee, CH
    1997 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, PROCEEDINGS, 1997, : 357 - 364
  • [29] Rapid speaker adaptation for continuous speech recognition
    Lu, Ping
    Wu, Ji
    Wang, Zuoying
    Lu, Dajin
    Qinghua Daxue Xuebao/Journal of Tsinghua University, 2002, 42 (07): : 977 - 980
  • [30] A comparative study of noise estimation algorithms for nonlinear compensation in robust speech recognition
    Zhao, Yong
    Juang, Biing-Hwang
    SPEECH COMMUNICATION, 2017, 89 : 58 - 69