Efficient Language Model Adaptation for Automatic Speech Recognition of Spoken Translations

被引:0
|
作者
Pelemans, Joris [1 ]
Vanallemeersch, Tom [2 ]
Demuynck, Kris [3 ]
Van Hamme, Hugo [1 ]
Wambacq, Patrick [1 ]
机构
[1] Katholieke Univ Leuven, ESAT, Leuven, Belgium
[2] Katholieke Univ Leuven, Ctr Computat Linguist, Leuven, Belgium
[3] Univ Ghent, ELIS, B-9000 Ghent, Belgium
关键词
speech recognition; language models; computer-aided translation; machine translation; efficient adaptation; INTEGRATION; DICTATION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Direct integration of translation model (TM) probabilities into a language model (LM) with the purpose of improving automatic speech recognition (ASR) of spoken translations typically requires a number of complex operations for each sentence. Many if not all of the LM probabilities need to be updated, the model needs to be renormalized and the ASR system needs to load a new, updated LM for each sentence. In computer-aided translation environments the time loss induced by these complex operations seriously reduces the potential of ASR as an efficient input method. In this paper we present a novel LM adaptation technique that drastically reduces the complexity of each of these operations. The technique consists of LM probability updates using exponential weights based on TM probabilities for each sentence and does not enforce probability renormalization. Instead of storing each resulting language model in its entirety, we only store the update weights which also reduces disk storage and loading time during ASR. Experiments on Dutch read speech translated from English show that both disk storage and recognition time drop dramatically compared to a baseline system that employs a more conventional way of updating the LM.
引用
收藏
页码:2262 / 2266
页数:5
相关论文
共 50 条
  • [21] Efficient automatic speech recognition
    O'Shaughnessy, D
    PROCEEDINGS OF THE EIGHTH IASTED INTERNATIONAL CONFERENCE ON INTERNET AND MULTIMEDIA SYSTEMS AND APPLICATIONS, 2004, : 323 - 327
  • [22] Factored Language Model Adaptation Using Dirichlet Class Language Model for Speech Recognition
    Hatami, Ali
    Akbari, Ahmad
    Nasersharif, Babak
    2013 5TH CONFERENCE ON INFORMATION AND KNOWLEDGE TECHNOLOGY (IKT), 2013, : 438 - 442
  • [23] Language model adaptation in speech recognition using document maps
    Lagus, K
    Kurimo, M
    NEURAL NETWORKS FOR SIGNAL PROCESSING XII, PROCEEDINGS, 2002, : 627 - 636
  • [24] Model adaptation for spoken language understanding
    Tur, G
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 41 - 44
  • [25] Unsupervised Language Model Adaptation by Data Selection for Speech Recognition
    Khassanov, Yerbolat
    Chong, Tze Yuang
    Bigot, Benjamin
    Chng, Eng Siong
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2017, PT I, 2017, 10191 : 508 - 517
  • [26] Using Automatic Speech Recognition in Spoken Corpus Curation
    Gorisch, Jan
    Gref, Michael
    Schmidt, Thomas
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 6423 - 6428
  • [27] Qualitative Evaluation of Language Model Rescoring in Automatic Speech Recognition
    Roux, Thibault Baneras
    Rouvier, Mickael
    Wottawa, Jane
    Dufour, Richard
    INTERSPEECH 2022, 2022, : 3968 - 3972
  • [28] A new language model for an automatic Arabic speech recognition system
    Rashwan, M.
    Journal of Engineering and Applied Science, 2002, 49 (01): : 175 - 193
  • [29] Universal attribute characterization of spoken languages for automatic spoken language recognition
    Siniscalchi, Sabato Marco
    Reed, Jeremy
    Svendsen, Torbjorn
    Lee, Chin-Hui
    COMPUTER SPEECH AND LANGUAGE, 2013, 27 (01): : 209 - 227
  • [30] NORMALIZATION AND ADAPTATION OF SPEECH DATA FOR AUTOMATIC SPEECH RECOGNITION
    SCARR, RWA
    INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1970, 2 (01): : 41 - 59