RAPID JOINT SPEAKER AND NOISE COMPENSATION FOR ROBUST SPEECH RECOGNITION

被引：0

作者：

Chin, K. K. ^{[1
]}

Xu, Haitian ^{[1
]}

Gales, Mark J. F. ^{[1
]}

Breslin, Catherine ^{[1
]}

Knill, Kate ^{[1
]}

机构：

[1] Toshiba Res Europe Ltd, Cambridge Res Lab, Cambridge, England

来源：

2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2011年

关键词：

Speaker adaptation; Noise compensation; Robust ASR; Rapid adaptation;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

For speech recognition, mismatches between training and testing for speaker and noise are normally handled separately. The work presented in this paper aims at jointly applying speaker adaptation and model-based noise compensation by embedding speaker adaptation as part of the noise mismatch function. The proposed method gives a faster and more optimum adaptation compared to compensating for these two factors separately. It is also more consistent with respect to the basic assumptions of speaker and noise adaptation. Experimental results show significant and consistent gains from the proposed method.

引用

页码：5500 / 5503

页数：4

共 50 条

[21] A Noise Robust Speech Recognition Method Using Model Compensation Based on Speech Enhancement
Shen, Guanghu
Jung, Ho-Youl
Chung, Hyun-Yeol
JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2008, 27 (04): : 191 - 199
[22] An integrated study of speaker normalisation and HMM adaptation for noise robust speaker-independent speech recognition
Hariharan, R
Viikki, O
SPEECH COMMUNICATION, 2002, 37 (3-4) : 349 - 361
[23] Model-space compensation of microphone and noise for speaker-independent speech recognition
Gong, YF
2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 660 - 663
[24] A method of joint compensation of additive and convolutive distortions for speaker-independent speech recognition
Gong, YF
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (05): : 975 - 983
[25] MULTILEVEL SPEECH INTELLIGIBILITY FOR ROBUST SPEAKER RECOGNITION
Nemala, Sridhar Krishna
Elhilali, Mounya
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4393 - 4396
[26] Maximum likelihood joint estimation of channel and noise for robust speech recognition
Zhao, YX
2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1109 - 1112
[27] Joint Uncertainty Decoding With Predictive Methods for Noise Robust Speech Recognition
Xu, Haitian
Gales, Mark J. F.
Chin, K. K.
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (06): : 1665 - 1676
[28] Adaptive compensation for robust speech recognition
Lee, CH
1997 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, PROCEEDINGS, 1997, : 357 - 364
[29] Rapid speaker adaptation for continuous speech recognition
Lu, Ping
Wu, Ji
Wang, Zuoying
Lu, Dajin
Qinghua Daxue Xuebao/Journal of Tsinghua University, 2002, 42 (07): : 977 - 980
[30] A comparative study of noise estimation algorithms for nonlinear compensation in robust speech recognition
Zhao, Yong
Juang, Biing-Hwang
SPEECH COMMUNICATION, 2017, 89 : 58 - 69

← 1 2 3 4 5 →