RAPID JOINT SPEAKER AND NOISE COMPENSATION FOR ROBUST SPEECH RECOGNITION

被引：0

作者：

Chin, K. K. ^{[1
]}

Xu, Haitian ^{[1
]}

Gales, Mark J. F. ^{[1
]}

Breslin, Catherine ^{[1
]}

Knill, Kate ^{[1
]}

机构：

[1] Toshiba Res Europe Ltd, Cambridge Res Lab, Cambridge, England

来源：

2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2011年

关键词：

Speaker adaptation; Noise compensation; Robust ASR; Rapid adaptation;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

For speech recognition, mismatches between training and testing for speaker and noise are normally handled separately. The work presented in this paper aims at jointly applying speaker adaptation and model-based noise compensation by embedding speaker adaptation as part of the noise mismatch function. The proposed method gives a faster and more optimum adaptation compared to compensating for these two factors separately. It is also more consistent with respect to the basic assumptions of speaker and noise adaptation. Experimental results show significant and consistent gains from the proposed method.

引用

页码：5500 / 5503

页数：4

共 50 条

[41] Noise compensation for speech recognition with arbitrary additive noise
Ming, J
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (03): : 833 - 844
[42] Joint Tracking of Clean Speech and Noise Using HMMs and Particle Filters for Robust Speech Recognition
Mushtaq, Aleem
Lee, Chin-Hui
2012 CONFERENCE RECORD OF THE FORTY SIXTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS (ASILOMAR), 2012, : 1618 - 1622
[43] EXPLOITING LONG-RANGE TEMPORAL DYNAMICS OF SPEECH FOR NOISE-ROBUST SPEAKER RECOGNITION
Jafari, Ayeh
Srinivasan, Ramji
Crookes, Danny
Ming, Ji
19TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2011), 2011, : 2123 - 2127
[44] Speech Feature Compensation Based on Pseudo Stereo Codebooks for Robust Speech Recognition in Additive Noise Environments
Hsieh, Tsung-hsueh
Hung, Jeih-weih
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2400 - 2403
[45] JOINT UNCERTAINTY DECODING WITH THE SECOND ORDER APPROXIMATION FOR NOISE ROBUST SPEECH RECOGNITION
Xu, Haitian
Chin, K. K.
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3841 - 3844
[46] Mixtures of Bayesian Joint Factor Analyzers for Noise Robust Automatic Speech Recognition
Cui, Xiaodong
Goel, Vaibhava
Kingsbury, Brian
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3011 - 3015
[47] Comparison of Estimation Techniques in Joint Uncertainty Decoding for Noise Robust Speech Recognition
Xu, Haitian
Chin, K. K.
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2363 - 2366
[48] Channel and speaker adaptation techniques for robust speech recognition
Chen, Jingdong
Yao, Lei
Huang, Taiyi
Shengxue Xuebao/Acta Acustica, 1998, 23 (06): : 537 - 544
[49] Robust Digital Speech Watermarking For Online Speaker Recognition
Nematollahi, Mohammad Ali
Gamboa-Rosales, Hamurabi
Akhaee, Mohammad Ali
Al-Haddad, S. A. R.
MATHEMATICAL PROBLEMS IN ENGINEERING, 2015, 2015
[50] Channel Robust MFCCs for Continuous Speech Speaker Recognition
Chougule, Sharada Vikram
Chavan, Mahesh S.
ADVANCES IN SIGNAL PROCESSING AND INTELLIGENT RECOGNITION SYSTEMS, 2014, 264 : 557 - 568

← 1 2 3 4 5 →