Discriminative language model adaptation for Mandarin broadcast speech transcription and translation

被引：0

作者：

Liu, X. A. ^{[1
]}

Byrne, W. J. ^{[1
]}

Gales, M. J. F. ^{[1
]}

de Gispert, A. ^{[1
]}

Tomalin, M. ^{[1
]}

Woodland, P. C. ^{[1
]}

Yu, K. ^{[1
]}

机构：

[1] Univ Cambridge, Dept Engn, Cambridge CB2 1PZ, England

来源：

2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2 | 2007年

关键词：

speech recognition and translation; language model adaptation; discriminative training;

D O I：

10.1109/ASRU.2007.4430101

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper investigates unsupervised test-time adaptation of language models (LM) using discriminative methods for a Mandarin broadcast speech transcription and translation task. A standard approach to adapt interpolated language models to is to optimize the component weights by minimizing the perplexity on supervision data. This is a widely made approximation for language modeling in automatic speech recognition (ASR) systems. For speech translation tasks, it is unclear whether a strong correlation still exists between perplexity and various forms of error cost functions in recognition and translation stages. The proposed minimum Bayes risk (MBR) based approach provides a flexible framework for unsupervised LM adaptation. It generalizes to a variety of forms of recognition and translation error metrics. LM adaptation is performed at the audio document level using either the character error rate (CER), or translation edit rate (TER) as the cost function. An efficient parameter estimation scheme using the extended Baum-Welch (EBW) algorithm is proposed. Experimental results on a state-of-the-art speech recognition and translation system are presented. The MBR adapted language models gave the best recognition and translation performance and reduced the TER score by up to 0.54% absolute.

引用

页码：153 / 158

页数：6

共 50 条

[1] Statistical language model adaptation for Mandarin broadcast news transcription
Chen, B
Tsai, WH
Kuo, JW
[J]. 2004 International Symposium on Chinese Spoken Language Processing, Proceedings, 2004, : 313 - 316
[2] Unsupervised Language Model Adaptation for Mandarin Broadcast Conversation Transcription
Mrva, David
Woodland, Philip C.
[J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2210 - 2213
[3] The IBM mandarin broadcast speech transcription system
Chu, Stephen M.
Kuo, Hong-kwang
Liu, Yi Y.
Qin, Yong
Shi, Qin
Zweig, Geoffrey
[J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PTS 1-3, 2007, : 345 - +
[4] Language model adaptation in machine translation from speech
Bulyko, Ivan
Matsoukas, Spyros
Schwartz, Richard
Nguyen, Long
Makhoul, John
[J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 117 - +
[5] Multifactor Adaptation for Mandarin Broadcast News and Conversation Speech Recognition
Wang, Wen
Mandal, Arindam
Lei, Xin
Stolcke, Andreas
Zheng, Jing
[J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2099 - 2102
[6] Improving speech transcription for Mandarin-English translation
Tomalin, M.
Gales, M. J. F.
Liu, X. A.
Sim, K. C.
Sinha, R.
Wang, L.
Woodland, P. C.
Yu, K.
[J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 97 - +
[7] Recurrent Neural Network Based Language Model Adaptation for Accent Mandarin Speech
Ni, Hao
Yi, Jiangyan
Wen, Zhengqi
Tao, Jianhua
[J]. PATTERN RECOGNITION (CCPR 2016), PT II, 2016, 663 : 607 - 617
[8] Online Temporal Language Model Adaptation for a Thai Broadcast News Transcription System
Saykham, Kwanchiva
Chotimongkol, Ananlada
Wutiwiwatchai, Chai
[J]. LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 1690 - 1694
[9] RESAMPLING AUXILIARY DATA FOR LANGUAGE MODEL ADAPTATION IN MACHINE TRANSLATION FOR SPEECH
Maskey, Sameer
Sethy, Abhinav
[J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4817 - +
[10] Advances in mandarin broadcast speech transcription at IBM under the DARPA GALE program
Qin, Yong
Shi, Qin
Liu, Yi Y.
Aronowitz, Hagai
Chu, Stephen M.
Kuo, Hong-Kwang
Zweig, Geoffrey
[J]. CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 410 - +

← 1 2 3 4 5 →