Discriminative language model adaptation for Mandarin broadcast speech transcription and translation

被引:0
|
作者
Liu, X. A. [1 ]
Byrne, W. J. [1 ]
Gales, M. J. F. [1 ]
de Gispert, A. [1 ]
Tomalin, M. [1 ]
Woodland, P. C. [1 ]
Yu, K. [1 ]
机构
[1] Univ Cambridge, Dept Engn, Cambridge CB2 1PZ, England
关键词
speech recognition and translation; language model adaptation; discriminative training;
D O I
10.1109/ASRU.2007.4430101
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper investigates unsupervised test-time adaptation of language models (LM) using discriminative methods for a Mandarin broadcast speech transcription and translation task. A standard approach to adapt interpolated language models to is to optimize the component weights by minimizing the perplexity on supervision data. This is a widely made approximation for language modeling in automatic speech recognition (ASR) systems. For speech translation tasks, it is unclear whether a strong correlation still exists between perplexity and various forms of error cost functions in recognition and translation stages. The proposed minimum Bayes risk (MBR) based approach provides a flexible framework for unsupervised LM adaptation. It generalizes to a variety of forms of recognition and translation error metrics. LM adaptation is performed at the audio document level using either the character error rate (CER), or translation edit rate (TER) as the cost function. An efficient parameter estimation scheme using the extended Baum-Welch (EBW) algorithm is proposed. Experimental results on a state-of-the-art speech recognition and translation system are presented. The MBR adapted language models gave the best recognition and translation performance and reduced the TER score by up to 0.54% absolute.
引用
收藏
页码:153 / 158
页数:6
相关论文
共 50 条
  • [1] Statistical language model adaptation for Mandarin broadcast news transcription
    Chen, B
    Tsai, WH
    Kuo, JW
    [J]. 2004 International Symposium on Chinese Spoken Language Processing, Proceedings, 2004, : 313 - 316
  • [2] Unsupervised Language Model Adaptation for Mandarin Broadcast Conversation Transcription
    Mrva, David
    Woodland, Philip C.
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2210 - 2213
  • [3] The IBM mandarin broadcast speech transcription system
    Chu, Stephen M.
    Kuo, Hong-kwang
    Liu, Yi Y.
    Qin, Yong
    Shi, Qin
    Zweig, Geoffrey
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PTS 1-3, 2007, : 345 - +
  • [4] Language model adaptation in machine translation from speech
    Bulyko, Ivan
    Matsoukas, Spyros
    Schwartz, Richard
    Nguyen, Long
    Makhoul, John
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 117 - +
  • [5] Multifactor Adaptation for Mandarin Broadcast News and Conversation Speech Recognition
    Wang, Wen
    Mandal, Arindam
    Lei, Xin
    Stolcke, Andreas
    Zheng, Jing
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2099 - 2102
  • [6] Improving speech transcription for Mandarin-English translation
    Tomalin, M.
    Gales, M. J. F.
    Liu, X. A.
    Sim, K. C.
    Sinha, R.
    Wang, L.
    Woodland, P. C.
    Yu, K.
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 97 - +
  • [7] Recurrent Neural Network Based Language Model Adaptation for Accent Mandarin Speech
    Ni, Hao
    Yi, Jiangyan
    Wen, Zhengqi
    Tao, Jianhua
    [J]. PATTERN RECOGNITION (CCPR 2016), PT II, 2016, 663 : 607 - 617
  • [8] Online Temporal Language Model Adaptation for a Thai Broadcast News Transcription System
    Saykham, Kwanchiva
    Chotimongkol, Ananlada
    Wutiwiwatchai, Chai
    [J]. LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 1690 - 1694
  • [9] RESAMPLING AUXILIARY DATA FOR LANGUAGE MODEL ADAPTATION IN MACHINE TRANSLATION FOR SPEECH
    Maskey, Sameer
    Sethy, Abhinav
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4817 - +
  • [10] Advances in mandarin broadcast speech transcription at IBM under the DARPA GALE program
    Qin, Yong
    Shi, Qin
    Liu, Yi Y.
    Aronowitz, Hagai
    Chu, Stephen M.
    Kuo, Hong-Kwang
    Zweig, Geoffrey
    [J]. CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 410 - +