Efficient Language Model Adaptation for Automatic Speech Recognition of Spoken Translations

被引：0

作者：

Pelemans, Joris ^{[1
]}

Vanallemeersch, Tom ^{[2
]}

Demuynck, Kris ^{[3
]}

Van Hamme, Hugo ^{[1
]}

Wambacq, Patrick ^{[1
]}

机构：

[1] Katholieke Univ Leuven, ESAT, Leuven, Belgium

[2] Katholieke Univ Leuven, Ctr Computat Linguist, Leuven, Belgium

[3] Univ Ghent, ELIS, B-9000 Ghent, Belgium

来源：

16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5 | 2015年

关键词：

speech recognition; language models; computer-aided translation; machine translation; efficient adaptation; INTEGRATION; DICTATION;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Direct integration of translation model (TM) probabilities into a language model (LM) with the purpose of improving automatic speech recognition (ASR) of spoken translations typically requires a number of complex operations for each sentence. Many if not all of the LM probabilities need to be updated, the model needs to be renormalized and the ASR system needs to load a new, updated LM for each sentence. In computer-aided translation environments the time loss induced by these complex operations seriously reduces the potential of ASR as an efficient input method. In this paper we present a novel LM adaptation technique that drastically reduces the complexity of each of these operations. The technique consists of LM probability updates using exponential weights based on TM probabilities for each sentence and does not enforce probability renormalization. Instead of storing each resulting language model in its entirety, we only store the update weights which also reduces disk storage and loading time during ASR. Experiments on Dutch read speech translated from English show that both disk storage and recognition time drop dramatically compared to a baseline system that employs a more conventional way of updating the LM.

引用

页码：2262 / 2266

页数：5

共 50 条

[21] Efficient automatic speech recognition
O'Shaughnessy, D
PROCEEDINGS OF THE EIGHTH IASTED INTERNATIONAL CONFERENCE ON INTERNET AND MULTIMEDIA SYSTEMS AND APPLICATIONS, 2004, : 323 - 327
[22] Factored Language Model Adaptation Using Dirichlet Class Language Model for Speech Recognition
Hatami, Ali
Akbari, Ahmad
Nasersharif, Babak
2013 5TH CONFERENCE ON INFORMATION AND KNOWLEDGE TECHNOLOGY (IKT), 2013, : 438 - 442
[23] Language model adaptation in speech recognition using document maps
Lagus, K
Kurimo, M
NEURAL NETWORKS FOR SIGNAL PROCESSING XII, PROCEEDINGS, 2002, : 627 - 636
[24] Model adaptation for spoken language understanding
Tur, G
2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 41 - 44
[25] Unsupervised Language Model Adaptation by Data Selection for Speech Recognition
Khassanov, Yerbolat
Chong, Tze Yuang
Bigot, Benjamin
Chng, Eng Siong
INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2017, PT I, 2017, 10191 : 508 - 517
[26] Using Automatic Speech Recognition in Spoken Corpus Curation
Gorisch, Jan
Gref, Michael
Schmidt, Thomas
PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 6423 - 6428
[27] Qualitative Evaluation of Language Model Rescoring in Automatic Speech Recognition
Roux, Thibault Baneras
Rouvier, Mickael
Wottawa, Jane
Dufour, Richard
INTERSPEECH 2022, 2022, : 3968 - 3972
[28] A new language model for an automatic Arabic speech recognition system
Rashwan, M.
Journal of Engineering and Applied Science, 2002, 49 (01): : 175 - 193
[29] Universal attribute characterization of spoken languages for automatic spoken language recognition
Siniscalchi, Sabato Marco
Reed, Jeremy
Svendsen, Torbjorn
Lee, Chin-Hui
COMPUTER SPEECH AND LANGUAGE, 2013, 27 (01): : 209 - 227
[30] NORMALIZATION AND ADAPTATION OF SPEECH DATA FOR AUTOMATIC SPEECH RECOGNITION
SCARR, RWA
INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1970, 2 (01): : 41 - 59

← 1 2 3 4 5 →