Efficient Language Model Adaptation for Automatic Speech Recognition of Spoken Translations

被引：0

作者：

Pelemans, Joris ^{[1
]}

Vanallemeersch, Tom ^{[2
]}

Demuynck, Kris ^{[3
]}

Van Hamme, Hugo ^{[1
]}

Wambacq, Patrick ^{[1
]}

机构：

[1] Katholieke Univ Leuven, ESAT, Leuven, Belgium

[2] Katholieke Univ Leuven, Ctr Computat Linguist, Leuven, Belgium

[3] Univ Ghent, ELIS, B-9000 Ghent, Belgium

来源：

16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5 | 2015年

关键词：

speech recognition; language models; computer-aided translation; machine translation; efficient adaptation; INTEGRATION; DICTATION;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Direct integration of translation model (TM) probabilities into a language model (LM) with the purpose of improving automatic speech recognition (ASR) of spoken translations typically requires a number of complex operations for each sentence. Many if not all of the LM probabilities need to be updated, the model needs to be renormalized and the ASR system needs to load a new, updated LM for each sentence. In computer-aided translation environments the time loss induced by these complex operations seriously reduces the potential of ASR as an efficient input method. In this paper we present a novel LM adaptation technique that drastically reduces the complexity of each of these operations. The technique consists of LM probability updates using exponential weights based on TM probabilities for each sentence and does not enforce probability renormalization. Instead of storing each resulting language model in its entirety, we only store the update weights which also reduces disk storage and loading time during ASR. Experiments on Dutch read speech translated from English show that both disk storage and recognition time drop dramatically compared to a baseline system that employs a more conventional way of updating the LM.

引用

页码：2262 / 2266

页数：5

共 50 条

[1] Chameleon: A Language Model Adaptation Toolkit for Automatic Speech Recognition of Conversational Speech
Song, Yuanfeng
Jiang, Di
Zhao, Weiwei
Xu, Qian
Wong, Raymond Chi-Wing
Yang, Qiang
2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF SYSTEM DEMONSTRATIONS, 2019, : 37 - 42
[2] LSTM LANGUAGE MODEL ADAPTATION WITH IMAGES AND TITLES FOR MULTIMEDIA AUTOMATIC SPEECH RECOGNITION
Moriya, Yasufumi
Jones, Gareth. J. F.
2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 219 - 226
[3] Shrinkage Model Adaptation in Automatic Speech Recognition
Li, Jinyu
Tsao, Yu
Lee, Chin-Hui
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 1656 - +
[4] AutoSSR: an efficient approach for automatic spontaneous speech recognition model for the Punjabi Language
Yogesh Kumar
Navdeep Singh
Munish Kumar
Amitoj Singh
Soft Computing, 2021, 25 : 1617 - 1630
[5] AutoSSR: an efficient approach for automatic spontaneous speech recognition model for the Punjabi Language
Kumar, Yogesh
Singh, Navdeep
Kumar, Munish
Singh, Amitoj
SOFT COMPUTING, 2021, 25 (02) : 1617 - 1630
[6] LANGUAGE MODEL VERBALIZATION FOR AUTOMATIC SPEECH RECOGNITION
Sak, Hasim
Beaufays, Francoise
Nakajima, Kaisuke
Allauzen, Cyril
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8262 - 8266
[7] Topic identification techniques applied to dynamic language model adaptation for automatic speech recognition
Echeverry-Correa, J. D.
Ferreiros-Lopez, J.
Coucheiro-Limeres, A.
Cordoba, R.
Montero, J. M.
EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (01) : 101 - 112
[8] A temporal auditory model with adaptation for automatic speech recognition
Haque, Serajul
Togneri, Roberto
Zaknich, Anthony
2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 1141 - +
[9] Unsupervised Language Model Adaptation for Automatic Speech Recognition of Broadcast News Using Web 2.0
Schlippe, Tim
Gren, Lukasz
Vu, Ngoc Thang
Schultz, Tanja
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2697 - 2701
[10] Automatic Query Generation and Query Relevance Measurement for Unsupervised Language Model Adaptation of Speech Recognition
Akinori Ito
Yasutomo Kajiura
Motoyuki Suzuki
Shozo Makino
EURASIP Journal on Audio, Speech, and Music Processing, 2009

← 1 2 3 4 5 →