Efficient Language Model Adaptation for Automatic Speech Recognition of Spoken Translations

被引:0
|
作者
Pelemans, Joris [1 ]
Vanallemeersch, Tom [2 ]
Demuynck, Kris [3 ]
Van Hamme, Hugo [1 ]
Wambacq, Patrick [1 ]
机构
[1] Katholieke Univ Leuven, ESAT, Leuven, Belgium
[2] Katholieke Univ Leuven, Ctr Computat Linguist, Leuven, Belgium
[3] Univ Ghent, ELIS, B-9000 Ghent, Belgium
关键词
speech recognition; language models; computer-aided translation; machine translation; efficient adaptation; INTEGRATION; DICTATION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Direct integration of translation model (TM) probabilities into a language model (LM) with the purpose of improving automatic speech recognition (ASR) of spoken translations typically requires a number of complex operations for each sentence. Many if not all of the LM probabilities need to be updated, the model needs to be renormalized and the ASR system needs to load a new, updated LM for each sentence. In computer-aided translation environments the time loss induced by these complex operations seriously reduces the potential of ASR as an efficient input method. In this paper we present a novel LM adaptation technique that drastically reduces the complexity of each of these operations. The technique consists of LM probability updates using exponential weights based on TM probabilities for each sentence and does not enforce probability renormalization. Instead of storing each resulting language model in its entirety, we only store the update weights which also reduces disk storage and loading time during ASR. Experiments on Dutch read speech translated from English show that both disk storage and recognition time drop dramatically compared to a baseline system that employs a more conventional way of updating the LM.
引用
收藏
页码:2262 / 2266
页数:5
相关论文
共 50 条
  • [1] Chameleon: A Language Model Adaptation Toolkit for Automatic Speech Recognition of Conversational Speech
    Song, Yuanfeng
    Jiang, Di
    Zhao, Weiwei
    Xu, Qian
    Wong, Raymond Chi-Wing
    Yang, Qiang
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF SYSTEM DEMONSTRATIONS, 2019, : 37 - 42
  • [2] LSTM LANGUAGE MODEL ADAPTATION WITH IMAGES AND TITLES FOR MULTIMEDIA AUTOMATIC SPEECH RECOGNITION
    Moriya, Yasufumi
    Jones, Gareth. J. F.
    2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 219 - 226
  • [3] Shrinkage Model Adaptation in Automatic Speech Recognition
    Li, Jinyu
    Tsao, Yu
    Lee, Chin-Hui
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 1656 - +
  • [4] AutoSSR: an efficient approach for automatic spontaneous speech recognition model for the Punjabi Language
    Yogesh Kumar
    Navdeep Singh
    Munish Kumar
    Amitoj Singh
    Soft Computing, 2021, 25 : 1617 - 1630
  • [5] AutoSSR: an efficient approach for automatic spontaneous speech recognition model for the Punjabi Language
    Kumar, Yogesh
    Singh, Navdeep
    Kumar, Munish
    Singh, Amitoj
    SOFT COMPUTING, 2021, 25 (02) : 1617 - 1630
  • [6] LANGUAGE MODEL VERBALIZATION FOR AUTOMATIC SPEECH RECOGNITION
    Sak, Hasim
    Beaufays, Francoise
    Nakajima, Kaisuke
    Allauzen, Cyril
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8262 - 8266
  • [7] Topic identification techniques applied to dynamic language model adaptation for automatic speech recognition
    Echeverry-Correa, J. D.
    Ferreiros-Lopez, J.
    Coucheiro-Limeres, A.
    Cordoba, R.
    Montero, J. M.
    EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (01) : 101 - 112
  • [8] A temporal auditory model with adaptation for automatic speech recognition
    Haque, Serajul
    Togneri, Roberto
    Zaknich, Anthony
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 1141 - +
  • [9] Unsupervised Language Model Adaptation for Automatic Speech Recognition of Broadcast News Using Web 2.0
    Schlippe, Tim
    Gren, Lukasz
    Vu, Ngoc Thang
    Schultz, Tanja
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2697 - 2701
  • [10] Automatic Query Generation and Query Relevance Measurement for Unsupervised Language Model Adaptation of Speech Recognition
    Akinori Ito
    Yasutomo Kajiura
    Motoyuki Suzuki
    Shozo Makino
    EURASIP Journal on Audio, Speech, and Music Processing, 2009