Efficient Language Model Adaptation for Automatic Speech Recognition of Spoken Translations

被引:0
|
作者
Pelemans, Joris [1 ]
Vanallemeersch, Tom [2 ]
Demuynck, Kris [3 ]
Van Hamme, Hugo [1 ]
Wambacq, Patrick [1 ]
机构
[1] Katholieke Univ Leuven, ESAT, Leuven, Belgium
[2] Katholieke Univ Leuven, Ctr Computat Linguist, Leuven, Belgium
[3] Univ Ghent, ELIS, B-9000 Ghent, Belgium
关键词
speech recognition; language models; computer-aided translation; machine translation; efficient adaptation; INTEGRATION; DICTATION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Direct integration of translation model (TM) probabilities into a language model (LM) with the purpose of improving automatic speech recognition (ASR) of spoken translations typically requires a number of complex operations for each sentence. Many if not all of the LM probabilities need to be updated, the model needs to be renormalized and the ASR system needs to load a new, updated LM for each sentence. In computer-aided translation environments the time loss induced by these complex operations seriously reduces the potential of ASR as an efficient input method. In this paper we present a novel LM adaptation technique that drastically reduces the complexity of each of these operations. The technique consists of LM probability updates using exponential weights based on TM probabilities for each sentence and does not enforce probability renormalization. Instead of storing each resulting language model in its entirety, we only store the update weights which also reduces disk storage and loading time during ASR. Experiments on Dutch read speech translated from English show that both disk storage and recognition time drop dramatically compared to a baseline system that employs a more conventional way of updating the LM.
引用
收藏
页码:2262 / 2266
页数:5
相关论文
共 50 条
  • [41] Spoken Corpora Data, Automatic Speech Recognition, and Bias Against African American Language: The case of Habitual 'Be'
    Martin, Joshua L.
    PROCEEDINGS OF THE 2021 ACM CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY, FACCT 2021, 2021, : 284 - 284
  • [42] PHONETIC SUBSPACE ADAPTATION FOR AUTOMATIC SPEECH RECOGNITION
    Ghalehjegh, Sina Hamidi
    Rose, Richard C.
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7937 - 7941
  • [43] DOMAIN ADAPTATION FOR PARSING IN AUTOMATIC SPEECH RECOGNITION
    Marin, Alex
    Ostendorf, Mari
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [44] Pashto Spoken Digits Database for the Automatic Speech Recognition Research
    Abbas, Arbab Waseem
    Ahmad, Nasir
    Ali, Hazrat
    PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTOMATION AND COMPUTING (ICAC 12), 2012, : 348 - 351
  • [45] Automatic Fongbe Phoneme Recognition From Spoken Speech Signal
    Laleye, Frejus A. A.
    Ezin, Eugene C.
    Motamed, Cina
    ICINCO: PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS, VOL 1, 2016, : 102 - 109
  • [46] A Language Model Optimization Method for Turkish Automatic Speech Recognition System
    Oyucu, Saadin
    Polat, Huseyin
    JOURNAL OF POLYTECHNIC-POLITEKNIK DERGISI, 2023, 26 (03): : 1167 - 1178
  • [47] Unsupervised cross-adaptation approach for speech recognition by combined language model and acoustic model adaptation
    School of Science and Engineering, Yamagata University, Yonezawa, Japan
    APSIPA ASC - Asia-Pac. Signal Inf. Process. Assoc. Annu. Summit Conf., (943-946):
  • [48] ON THE IMPORTANCE OF ANALYTIC PHASE OF SPEECH SIGNALS IN SPOKEN LANGUAGE RECOGNITION
    Vijayan, Karthika
    Li, Haizhou
    Sun, Hanwu
    Lee, Kong Aik
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5194 - 5198
  • [49] Spoken language identification using large vocabulary speech recognition
    Bell Lab, Murray Hill, United States
    Int Conf Spoken Lang Process ICSLP Proc, 1600, (1780-1783):
  • [50] Spoken language recognition -: A step toward multilinguality in speech processing
    Navrátil, J
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (06): : 678 - 685