RAPID JOINT SPEAKER AND NOISE COMPENSATION FOR ROBUST SPEECH RECOGNITION

被引:0
|
作者
Chin, K. K. [1 ]
Xu, Haitian [1 ]
Gales, Mark J. F. [1 ]
Breslin, Catherine [1 ]
Knill, Kate [1 ]
机构
[1] Toshiba Res Europe Ltd, Cambridge Res Lab, Cambridge, England
关键词
Speaker adaptation; Noise compensation; Robust ASR; Rapid adaptation;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
For speech recognition, mismatches between training and testing for speaker and noise are normally handled separately. The work presented in this paper aims at jointly applying speaker adaptation and model-based noise compensation by embedding speaker adaptation as part of the noise mismatch function. The proposed method gives a faster and more optimum adaptation compared to compensating for these two factors separately. It is also more consistent with respect to the basic assumptions of speaker and noise adaptation. Experimental results show significant and consistent gains from the proposed method.
引用
收藏
页码:5500 / 5503
页数:4
相关论文
共 50 条
  • [41] Noise compensation for speech recognition with arbitrary additive noise
    Ming, J
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (03): : 833 - 844
  • [42] Joint Tracking of Clean Speech and Noise Using HMMs and Particle Filters for Robust Speech Recognition
    Mushtaq, Aleem
    Lee, Chin-Hui
    2012 CONFERENCE RECORD OF THE FORTY SIXTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS (ASILOMAR), 2012, : 1618 - 1622
  • [43] EXPLOITING LONG-RANGE TEMPORAL DYNAMICS OF SPEECH FOR NOISE-ROBUST SPEAKER RECOGNITION
    Jafari, Ayeh
    Srinivasan, Ramji
    Crookes, Danny
    Ming, Ji
    19TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2011), 2011, : 2123 - 2127
  • [44] Speech Feature Compensation Based on Pseudo Stereo Codebooks for Robust Speech Recognition in Additive Noise Environments
    Hsieh, Tsung-hsueh
    Hung, Jeih-weih
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2400 - 2403
  • [45] JOINT UNCERTAINTY DECODING WITH THE SECOND ORDER APPROXIMATION FOR NOISE ROBUST SPEECH RECOGNITION
    Xu, Haitian
    Chin, K. K.
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3841 - 3844
  • [46] Mixtures of Bayesian Joint Factor Analyzers for Noise Robust Automatic Speech Recognition
    Cui, Xiaodong
    Goel, Vaibhava
    Kingsbury, Brian
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3011 - 3015
  • [47] Comparison of Estimation Techniques in Joint Uncertainty Decoding for Noise Robust Speech Recognition
    Xu, Haitian
    Chin, K. K.
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2363 - 2366
  • [48] Channel and speaker adaptation techniques for robust speech recognition
    Chen, Jingdong
    Yao, Lei
    Huang, Taiyi
    Shengxue Xuebao/Acta Acustica, 1998, 23 (06): : 537 - 544
  • [49] Robust Digital Speech Watermarking For Online Speaker Recognition
    Nematollahi, Mohammad Ali
    Gamboa-Rosales, Hamurabi
    Akhaee, Mohammad Ali
    Al-Haddad, S. A. R.
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2015, 2015
  • [50] Channel Robust MFCCs for Continuous Speech Speaker Recognition
    Chougule, Sharada Vikram
    Chavan, Mahesh S.
    ADVANCES IN SIGNAL PROCESSING AND INTELLIGENT RECOGNITION SYSTEMS, 2014, 264 : 557 - 568